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Abstract. We study the Cauchy initial-value problem for the Benjamin-Ono equation in 
the zero-disperion limit, and we establish the existence of this limit in a certain weak sense 
by developing an appropriate analogue of the method invented by Lax and Levermore to 
analyze the corresponding limit for the Korteweg-de Vries equation. 



1. Introduction 



The Benjamin-Ono (BO) equation 



(1) 




0, X G M, t > 0, 



where e > is a constant and H. is the Hilbert transform operator defined by the Cauchy 
principal value integral 



is a model for weakly nonlinear dispersive waves on the interface between two ideal immis- 
cible fiuids, one of which may be considered to be infinitely deep. Applications include the 
modeling of internal waves in deep water [21 [TJ |29l H], and also the modeling of atmospheric 
waves like the dramatic "morning glory" phenomenon of northeastern Australia [28]. The 
relevant Cauchy problem is to determine the solution Ue{x,t) of ([T]) subject to a suitable 
initial condition Ue{x, 0) = uq{x) given for all x G M. 

The parameter e > is a measure of the relative strength of the dispersive and nonlin- 
ear effects in the system. In many applications one thinks of e as a small parameter in 
part because numerical experiments show that in this situation the finite-time formation of 
a shock wave (gradient catastrophe) in the formal limiting equation (obtained simply by 
setting e = in ([T])) is dispersively regularized by the generation of a smoothly modulated 
train of approximately periodic traveling waves, which correspond to so-called undular bores 
frequently observed in the evolution of physical internal waves. Snapshots from the solution 
of a Cauchy problem for ([T]) illustrating the averted shock and onset of an undular bore 
are shown in Figure [Tj These figures clearly show that the mathematical description of the 
undular bore consists of waves of amplitude independent of e and wavelength approximately 
proportional to e. We refer to the asymptotic analysis of the solution of the Cauchy problem 
with e- independent initial data Uq{x) in the limit e J, as the zero- dispersion limit. 



(2) 
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Figure 1. The evolution of a pulse under the BO equation. Top row: e = 
0.04. Bottom row: e = 0.02. In both cases the initial condition is the same: 



1.1. A related problem and its history. A more famous nonlinear dispersive wave equa- 
tion is the Korteweg-de Vries (KdV) equation 

a model for long surface waves on shallow water among a wide variety of other physical 
phenomena. When e > is small, this equation displays qualitatively similar behavior to 
that just illustrated for the BO equation: the dispersive term arrests the shock in the e = 
equation with the formation of a train of waves of amplitude approximately independent of 
e and wavelength proportional to e. 

The modeling of the zero-dispersion limit for the KdV equation has a long history going 
back to the work of Whitham [31] who used the method of averaging to propose a nonlinear 
hyperbolic system of three partial differential equations to describe the modulational vari- 
ables {e.g. slowly- varying amplitude, mean, and wavelength of the wavetrain). Whitham 
noted that the system of modulation equations he obtained had the nongeneric property 
that by choice of special dependent variables v^, v"^, and v^, it could be written in so-called 
Riemann invariant form, in which the three equations are only coupled through the charac- 
teristic velocities: 

(4) |^ + ^>''^''^')|^=°' ^ = ^'2' 3- 

Later, Gurevich and Pitaevskii [13] considered the problem of patching together solutions 
of Whitham's modulational system with solutions of the formal limiting equation (obtained 
by setting e = in (p|)) at two moving boundary points that delineate the oscillation zone; 



their goal was to provide a reasonable global approximation scheme for the solution of the 
initial-value problem for the KdV equation ([s]) in the zero-dispersion limit subject to given 
initial data 0) = Vo{x) independent of e. 

In the meantime, it was of course discovered that the KdV equation is a completely 
integrable system, posessing a compatible structure now called a Lax pair and a coincident 
solution procedure for addressing the Cauchy (initial-value) problem: the inverse- scattering 
transform. This development suggested that the methodology invented by Whitham could 
perhaps be placed on completely rigorous mathematical footing. After the exact periodic 
(and quasiperiodic) solutions of the KdV equation ^ were given a spectral interpretation by 
Its and Matveev [15] and Dubrovin, Matveev, and Novikov [llj, the Whitham modulation 
equations themselves were reinterpreted within the framework of integrability by Flaschka, 
Forest, and McLaughlin [I2j. (In particular this work made clear the reason why Whitham's 
equations could be placed in Riemann invariant form; it is a consequence of integrability.) 

The task that remained in the use of integrable machinery to study the zero-dispersion 
limit of the KdV equation was to rigorously analyze the Cauchy problem using the inverse- 
scattering transform. The first step in this program was taken by Lax and Levermore |20j 
who considered positive initial data vo{x) rapidly decaying to zero for large They used 
WKB methods to argue that the Schrodinger operator with potential —vo{x) that arises in 
the scattering theory is approximately reflectionless in the limit e J, 0. On an ad-hoc basis 
they replaced the true scattering data by its WKB analogue, retaining only contributions 
from a set of N{e) ~ discrete eigenvalues which are approximated by a Bohr-Sommerfeld 
quantization rule, which amounts to replacing the solution v^{x,t) of the Cauchy problem 
with another solution Ve{x,t) of ([s]) having e-dependent initial data close to vq. In this 
situation, the inverse-scattering procedure reduces to finite-dimensional (of dimension N{e)) 
linear algebra, and in fact the solution obtained by Cramer's rule can be reduced to the 
determinantal formula 

(5) v,{x,t) =2e^—\og{T{x,t)), r(x,t) =det(I + G(a;,t)), 

where G{x,t) is a positive-definite real symmetric matrix of dimension A^(e) x A^(e). Lax and 
Levermore then established the existence of the limit of 26^ log(r(x, t)) as e J, 0, uniformly 
on compact subsets of the (x, t)-plane. This yields weak convergence of v^{x,t) by differen- 
tiation of the limit function with respect to x. The Lax-Levermore method is to expand the 
determinant T{x,t) in principal minors indexed by subsets of the set of eigenvalues; noting 
that each term is positive they showed that the sum of terms is asymptotically dominated 
by its largest term, and they further approximated this discrete optimization problem with 
an e-independent (limiting) convex variational problem, explicitly parametrized by x and t, 
for measures. The weak zero-dispersion limit of the Cauchy problem for the KdV equation is 
therefore encoded implicitly in the solution of this variational problem. The Lax-Levermore 
method reproduces the specified initial data vo{x) at t = as e J, 0, which estabhshes validity, 
in a certain sense, of the WKB-based spectral approximation procedure in the first step. 

Later, Venakides [30j was able to extend the method of Lax and Levermore to higher order, 
capturing the form of the oscillations that are averaged out in the weak limit. This work at 
last made clear that the solution of the Cauchy problem for the KdV equation with smooth, 
e-independent initial data vo{x) really does generate after some fixed breaking time a train of 
high-frequency waves of exactly the kind originally considered without complete justification 

3 



by Whitham. More recently, the powerful steepest-descent method for matrix Riemann- 
Hilbert problems developed by Deift and Zhou was used to analyze the zero-dispersion 
limit for the KdV equation [8j. This technique is best viewed as a tool for converting weak 
asymptotics (the solution of the Lax-Levermore variational problem) into strong asymptotics 
(an improvement of the Venakides asymptotics in which the phase of the waveform is accurate 
to very high order). 

1.2. The zero-dispersion limit of the Benjamin-Ono equation. It turns out that the 
BO equation ([T]) is also an integrable equation, in the sense that it has a representation as 
the compatibility condition of an overdetermined Lax pair of linear problems [3]. In fact, 
both BO and KdV equations may be viewed as limiting cases (as depth of a fluid layer 
tends to infinity and zero, respectively) of the so-called intermediate long-wave equation 
|18j . itself an integrable system for arbitrary layer depth. However, the integrable structure 
of the BO equation is markedly different from that of the KdV equation. In particular, the 
nonlocality in the equation due to the presence of the Hilbert transform is mirrored in a 
certain nonlocality of the scattering and inverse-scattering problems. In place of the spectral 
theory of the selfadjoint Schrodinger (Sturm-Liouville) differential operator one has to work 
with the spectral theory of the nonlocal operator 

Here, the operator C+ is the selfadjoint orthogonal projection from L^(]R) onto the Hardy 
space of the upper half-plane, the Hilbert space on which L is selfadjoint, and denotes 
the operator of multiplication by Me(-,t). 

Certainly a key step forward in the theory of the zero-dispersion limit was taken by 
Dobrokhotov and Krichever [9] who noted that the second (time evolution) equation in 



the Lax pair for the BO equation (see (30) below) is simply a time-dependent Schrodinger 
equation whose potential is a function with an analytic continuation from the real x-axis 
into the upper half-plane and were able to adapt a pre-existing construction of "integrable" 
potentials for this equation to the appropriate Hardy-space setting. This allowed them to 
construct, from the Lax pair, a large family of periodic traveling wave solutions of the BO 
equation ([T]), along with quasiperiodic generalizations. Remarkably, unlike the corresponding 
exact solutions of the KdV equation (|3]) which are highly transcendental objects constructed 
from Riemann theta functions of hyperelliptic curves of arbitrary genus, the periodic and 
quasiperiodic solutions of the BO equation turn out to be simple rational functions of P 
exponential phases ^^i^i^-^j^)/^ _ in the same paper, Dobrokhotov and Krichever also carried 
out for the BO equation the analogue of the calculation of Flaschka, Forest, and McLaughlin 
|12j . deriving by multiphase averaging a system of equations governing the modulational 
variables for a slowly-varying train of P-phase waves. Here we arrive at a second remarkable 
fact: not only can the modulation equations be written in Riemann invariant form, they are 
completely diagonal: 

(7) 

(the case of P = 1 corresponds to simple traveling waves). This again should be contrasted 
with the situation for the KdV equation in which the characteristic velocities not only provide 



coupling among the fields but also are transcendental functions of the fields written in terms 
of ratios of complete hyperelliptic integrals. 

The analogue for the BO equation of the matching procedure developed by Gurevich and 
Pitaevskii [H] to describe the evolution of a dispersive shock was independently described by 
Matsuno [211 and by Jorge, Minzoni, and Smyth [12]. This matching procedure provides 
a reasonable approach to the Cauchy problem for the Benjamin-Ono equation ([T]) with fixed 
initial data Ue{x,0) = uo{x) when e ^ 1, but it is based on formal asymptotics. In [24j . 
Matsuno writes: 

From a rigorously mathematical point of view, however, the various results 
presented in this paper should be justified on the basis of an exact method of 
solution such as [the inverse-scattering transform], or an analog of the Lax- 
Levermore theory for the KdV equation. 

It is our intention in this paper to provide exactly such a justification, by developing a new 
method that does for the BO Cauchy problem exactly what the Lax-Levermore method does 
for the KdV Cauchy problem. 

The main result of our analysis is remarkably easy to state, but first we need to recall 
some basic facts concerning the equation obtained from ([T]) simply by setting e = 0. Recall 
that while for general sufficiently smooth initial data u^{x,0) = uq{x) the inviscid Burgers 
equation 

does not have a global solution as a function due to gradient catastrophe (shock formation) 
in finite time, it does have a global solution as a real multi-sheeted surface over the [x, t)- 
plane; indeed this is the construction of the method of characteristics. The sheets of this 
surface are obtained as the real solutions of the implicit equation 

(9) = uo{x -2uh), 

and by implicit differentiation it is easy to verify that away from singularities each sheet 
of the surface is a function = u^{x,t) that satisfies ([s]). A simple consequence of the 
Implicit Function Theorem is that for sufficiently small \t\ there is a unique solution of ^ 
for all a; e M. New sheets of the multivalued solution are born from breaking points in the 
{x, t)-plane that are in one-to-one correspondence with generic infiection points ^ of Uq for 
which u'q{^) 7^ but u'^{^) = 0. If ,^ G M is such a point, then the corresponding breaking 
point is given by 

Each such breaking point is the location of a pitchfork bifurcation for with respect to 
t holding X — 2uo{C,)t = ^ fixed, with two new branches emerging as \t\ increases. Thus, 
assuming that Uq is a bounded function of total integral zero, the solution of the Cauchy 
problem for ([8| is classical for 



(11) T_ := — - < t < — - =: T4 

2max^^uUQ[x) 2 mm^jeR ^(a;) 
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Note that under our assumptions on u'q we have T_ < < T+. Also, T_ is the supremum of 
all < while T+ is the infimum of all > 0. When we consider the Cauchy problem for 
t > 0, we will refer to T := T+ as the breaking time. 

For t/t^ > 1 there are caustic curves x'^{t) < x^{t) with limiting values as t — )■ given 
by x'^{t^) = x^(t^) = x^ that bound the triply-folded region emerging from {x^,t^). The 
caustic curves correspond to double roots of and crossing one of them at a generic point 
results in a change in the number of sheets by exactly two. Except along the union of the 
caustic curves and the breaking points from which they emerge, the number of solutions of 
(|9| is always odd, and all are simple roots. See Figure [2] 
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Figure 2. Except along the caustic curves x = x^{t) and x = x^{t) the 
number of solutions of ^ is of the form 2P + 1, and these solutions are 
simple roots. For this figure, uo{x) := 2(1 + x^)~^ . 

For the initial data uo{x) = 2(1 + used in Figure [T| the breaking time before which 

there is a unique solution for all x G M and after which there is an expanding interval in 
which there are three solutions, is exactly T = 2-\/3/9 ~ 0.3849. Snapshots of the evolution 
of the multivalued solution of ([s]) for this initial data are shown in Figure |3] Our result is 
then the following. 

Theorem 1.1. Let u^{x.,t) < uf{x,t) < ■■■ < U2P{xt)i^'^) branches of the multival- 

ued (method of characteristics) solution of the inviscid Burgers equation ([s]) subject to an 
admissible initial condition u^{x,0) = Uo{x). Then, the weak L^(]R) (in x) limit of u^{x,t) 
is given by 

2P{x,t) 

(12) Wx-\imu,{x,t) = V (-lfM^(a;,t), 

uniformly for t in arbitrary bounded intervals. Note that the right-hand side extends by 
continuity to the caustic curves. 

The signed sum of branches that is the weak limit is illustrated with red curves in Figure |3] 
for the same initial data as in Figure [l] Of course convergence in the weak L^(M) (in x) 
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Figure 3. The multivalued solution (black) of ^ and the signed sum of 
branches (red) corresponding to Uq{x) = 2(1 + x^)^^. Left: t = 0. Middle: 
t = 1. Right: t = 2. Before the breaking time as well as afterwards but 
outside the oscillation interval there is only one solution branch and hence no 
difference between the red and black curves. 



topology means that for every v G L^(]R), we have 



(13) 



hm / uJx, t)v(x) dx 



2P(x,t) 
J2 



n=0 



v{x) dx 



with the hmit being uniform with respect to t in arbitrary bounded intervals. Thus, the weak 
limit essentially smooths out the rapid oscillations seen in Figure [l] and (if we think of v as 
the indicator function of a mesoscale interval) represents a kind of local average in x. What 
it means for an initial condition to be admissible will be explained later (see Definition 3.1). 
Here Ueix,t) is not exactly the solution of the Cauchy problem for the BO equation ([Ij) 
with fixed initial data Uq{x), but it is for every e > an exact solution of ([T]) that satisfies 
an e-dependent initial condition that converges (in the strong sense, see Corollary |l.l| 
below) to 'Uo(') as e J, 0. See Definition 3.2 for more details. This modification of the initial 



data is an analogue of the replacement of the true scattering data by its reflect ionless WKB 
approximation in the Lax-Levermore theory. 

For t before the breaking time T for Burgers' equation, the weak limit guaranteed by 
Theorem 1.1 may be strengthened as follows. 



Corollary 1.1. Suppose that < t < T , so that P{x,t) = for all x E (that is, the 
solution = u^{x,t) of Burgers' equation with initial data Uo{x) is classical). Then 

(14) 

with the limit being in the (strong) L^(l 



limWe 

€4,0 



>,t) 



{x,t) 



topology. 



It should be pointed out that the weak limit formula (12) is much more explicit than the 
corresponding formula found by Lax and Levermore [2D] for the weak zero-dispersion limit 
of the Cauchy problem for the KdV equation. Indeed, the latter requires the solution, for 
each X and t, of a constrained functional variational problem, which can be solved in closed 
form only for the simplest initial data. As we will now see, there are several classical wave 
propagation problems whose asymptotic behavior can be reduced to the multivalued solution 
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of Burgers' equation; however, even these simple problems involve more complicated schemes 



for combining the solution branches than that exhibited in the simple formula (12). 

After we introduce the necessary framework for our study (the inverse-scattering transform 
for the BO equation) in ^ we will analyze the direct scattering map in the zero-dispersion 



limit in q3j Then we will prove Theorem 1.1 in §4^ by carrying out a detailed analysis of 



the inverse scattering map applied to the asymptotic formulae for scattering data obtained 
in ^ In ^ we prove Corollary |1.1[ and then in ^ we will illustrate our results with 
numerical calculations and address the relation between Ue{x, t) and the true solution Ue{x, t) 
of the Cauchy problem for the BO equation with initial data uq. Some comments about our 
continuing work can be found in the conclusion, ^ But before we embark on our study of 



the BO equation, we pause to consider some familiar analogues of Theorem 1.1 



1.3. Elementary examples. The key role played in the zero-dispersion limit of the BO 
Cauchy problem by the multivalued solution of the equation ([s]) with the same initial data 
is reminiscent of two basic example problems from the theory of linear and nonlinear waves. 

1.3.1. The zero-viscosity limit of the viscous Burgers equation. The Burgers equation with 
viscosity e > is the nonlinear wave equation 

(15) _ + 2.,__,_.0. .eE, t>0 

and we take fixed initial data We{x,0) = uo{x). As is well-known, this Cauchy problem is 
solved by the Cole-Hopf transformation, leading to the exact solution formula 



(16) w,{x,t) = —^^^-p , t>0 



2t 



where the exponent function is defined as 
(17) Ri^;x,t) := - / Uo{r])dri 







At 



One examines the asymptotic behavior in the limit e J, by using Laplace's method to 
analyze the integrals (see §3.6). The dominant contributions to the integrals come from 
neighborhoods of points ^ = ^(a;, t) G M at which i?(^; x, t) achieves its maximum value. The 
critical points of R satisfy ^ = x — 2Mo(^)t. Writing = uo{0 applying uq to both 
sides gives the equation (|9|, so the critical points correspond to the sheets of the multivalued 
solution of the (inviscid) Burgers equation ([s]) with initial data uq. It is easy to check that 
if X and t are such that there is just one sheet, then the unique critical point is the global 
maximizer of R and Laplace's method gives the result that w^{x,t) converges (strongly, 
pointwise in x and t) to u^{x, t). On the other hand, if there are 2P -|- 1 > 1 sheets, then for 
generic {x,t) exactly one of them corresponds to the global maximum of R, and Laplace's 
method predicts that w^{x,t) will converge to the maximizing sheet. Shocks appear in the 
small viscosity limit as curves in the {x, t)-plane along which there are jump discontinuities 
of the pointwise limit corresponding to sudden changes in the choice of sheet that maximizes 
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the exponent R. To summarize, we have the formula 

(18) lim We(x, t) = ^^^(x, t), n= argmax R{x — 2u^{x , t)t; x , t) 

40 m=0,...,2P(x,t)+l 

for {x, t) not on a shock. 

Thus, one sees that for the zero-viscosity limit of the viscous Burgers equation, different 
sheets of the multivalued solution of the formal limiting Cauchy problem (set e = 0) provide 
the strong limit of We{x,t) for different x and t. However, the choice of sheet requires the 
solution of a discrete maximization problem parametrized by x and t, making the limiting 
behavior harder to calculate than the weak zero-dispersion limit of the BO equation. 

1.3.2. The semiclassical limit of the free linear Schrddinger equation. In this problem, one 
considers the equation 

(19) ie^ + 2e^^-^ = 0, x G M, t>0 

ot ox^ 

for small e > 0, subject to initial data of WKB form 

(20) ^/^,(x,0) = A(x)e*^(^)/^ 

with A and 5* real-valued and independent of e. For suitable A and S, the solution to this 
problem can be written as an integral 

(21) ^^(a:,t) = ^^y^e^^«^-'*)/M(Oc^e, t>0, /(e; x, t) := 5(0 + 

The dominant contributions to the solution are calculated via the method of stationary phase 
(see [2^, §5.6), and these come from small neighborhoods of points C, satisfying x, t) = 0, 
that is, solutions ^ of the implicit equation C, = x — 4S'{C,)t. Evaluating the function 2S'{-) 
on both sides of this equation and making the substitution = 2S'{^), one arrives at the 
equivalent form ^ where uo{x) := 2S'{x). Thus, the branches of the multivalued solution of 
Burgers' equation ([s]) with initial condition uq correspond to stationary phase points ^ that 
yield the leading term of the solution ip^{x,t) in the semiclassical limit e 4 0. Unlike in the 
analysis of Laplace-type integrals, where only the critical points corresponding to maxima 
matter in the limit, for oscillatory integrals all stationary phase points contribute to the 
leading-order behavior, and therefore we have an asymptotic representation of ip^{x,t) as a 
sum over branches u^{x, t) of the multivalued solution of Burgers' equation with initial data 

Uq. 

2P(x,t) 

(22) A{x,t)= ^ M„(x,t)e^^'=(^'*'^) + 0(e), t > 0, 

ra=0 

where M„(x,t) are slowly- varying positive amplitudes given by 



(23) M„(x, t) := A{x - 2<(x, t)t) 

and 9k{x, t; e) are rapidly-varying real phases given by 

1 TT / / du^ 

(24) enix,t;e) := -I{x - 2u^{x,t)t; x,t) + - f sgn M - 2t^(x, t) ) - 1 
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ioT n = 0,...,2P{x,t). 

A more explicit connection with the multivalued solution of Burgers' equation may be 
obtained by introducing the quantity 



(25) 



w,{x,t) :=2e^»{log(V'.(x,t))} 



which is the fluid velocity in Madelung's interpretation of the wave function ip^- as describing 



a quantum-corrected fluid motion. Under the condition that the error term in (22) becomes 



0(1) after differentiation with respect to x, some easy calculations show that (22) implies 

' 2P{x,t) 



(26) 



w^{x,t) = 3? < 



n=0 



2P{x,t) 
n=0 



>+0(e). 



It is then easy to see that if P{x,t) = 0, We{x,t) converges strongly pointwise to UQ{x,t), 
the unique solution (for this x and t, anyway) of Burgers' equation. On the other hand, if 
P{x, t) > 0, then there are interference effects among the terms in the sums and these lead 
to rapid oscillations with the effect that WfXx,t) no longer converges in the pointwise sense 
as e J, 0. However, it does converge in the weak topology. The weak limit may be computed 
by multiphase averaging, which we illustrate in the case P{x,t) = 1. The procedure is to 
average the leading term in w^{x, t) over an interval in x centered at the point of interest of 
radius, say, for some p G (0, 1), and then pass to the limit e | 0. This produces the desired 
local average over rapid oscillations of wavelength or period proportional to e. Under an 
ergodic hypothesis that is valid on a set of full measure in the {x, t)-plane, this procedure is 
equivalent to holding and M„ fixed and averaging (with uniform measure) over the torus 
of relative angles (pi := 6i — 6q and (p2 '■= (^2 — Oo- The double integrals can be evaluated 
explicitly, with the result that 



(27) 



w. 



.— lim W( 



i^^'t) = ^Cnix,t)u^{x,t), 



n=0 



where c„(x,t), n = 0,1,2, are nonnegative coefficients with the property that Co{x,t) + 
Ci{x,t) + C2{x,t) = 1. Specifically, the coefficients only depend on x and t through Mo(x,t), 
Mi{x, t), and M2(x, t). If any of these, say M„, exceeds the sum of the other two, then c„ = 1 
and the two other coefficients vanish. Thus the weak limit produces in this case exactly the 
branch u^{x, t) through multiphase averaging On the other hand, if none of the Mn exceeds 
the sum of the other two, then Mq, Mi, and M2 are the side lengths of a triangle, and the 

^ The strict inequality, AIo{x,t) > Mi{x,t) + M2{x,t) or a permutation thereof, defining this situation 
is an open condition on {x,t) G M^, and therefore (depending on initial conditions) there can exist open 
domains in the (a;,i)-plane on which the weak limit of We{x,t) is given by a single branch of the solution 
of the inviscid Burgers equation while w^^Xjt) itself exhibits wild oscillations. Interestingly, this is precisely 
the conjecture made by von Neumann regarding grid-scale oscillations observed in the numerical solution of 
Burgers' equation via a finite-difference scheme (which may be viewed as a dispersive regularization of the 
equation). While many model equations for finite-difference schemes (the KdV equation is one example) do 
not yield such a simple interpretation of the weak limit |19j . it would seem that von Neumann's conjecture 
can hold true if the Schrodinger equation is viewed as a dispersive correction to Burgers' equation. 
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weak limit is a genuine weighted average of the three branches, with weights proportional to 
the opposite angles: 



TT 

(28) ci = — arccos 

TT 
1 

C2 = — arccos 

TT 



1 fM'l + M| - Ml 

Co = — arccos 





2M1M2 




Ml 


+ M|- 






2M0M2 




Ml 


+ M2- 




2MqMi 



The most significant aspect of this analysis is that the weak limit depends on information 
other than just the initial condition uq for Burgers' equation since the functions Mn{x,t) 
involve also the initial wave function amplitude A. This makes the evaluation of the weak 
limit a more complicated procedure than in the case of the BO equation. 

A further connection between the BO equatio n ^ and the linear Schrodinger equation 



(19) in the zero-dispersion limit will be made in ^2.4 



2. Relevant Aspects of the Inverse Scattering Transform for the BO 

Cauchy Problem 

2.1. The Lax pair for the BO equation and its basic properties. The Lax pair [3], 
whose compatibility condition is the BO equation ([T|, consists of the two equations 



(29) 



^e— h A{w^ 

ox 



w 



(30) 



dw 



± 



le- 



dt 



2iXe- 



dw- 



dx 



2iC. 



± 



dx 



w 



± 



where A G C is a spectral parameter, = u^{x,t) is a solution of ([T]), and 



w 



w- 



(x,t; A) 



are functions that are required to be, for each fixed t and A, the boundary values on the 
real x-axis of functions analytic in the upper (+) and lower (— ) half complex x-plane. Also, 
±C± = |l =F ^iH are the orthogonal and complementary (C+ — C_ =1, the Plemelj formula) 
projections from L^(]R) onto its upper and lower Hardy subspaces EI^(]R). From the point of 
view of the inverse-scattering transform, i.e. using the Lax pair as a tool to solve the Cauchy 



problem, equation (29) may be considered for fixed time t and defines the scattering data 
associated with Ue{x,t) at time t. The function w~ may be viewed as a kind of Lagrange 
multiplier present to satisfy the constraint that be an "upper" function. In fact, if 
w"^ G then by applying C+ to (29) and using the projective identities C+[w^ 
C^[w^] = 0, (29) can be written in the form of an eigenvalue problem 



and 



(31) 



w 



G H+(R), 



where C is the nonlocal selfadjoint operator Equation (30) determines the (trivial, as 
we will recall) time dependence of the scattering data. 
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2.2. Scattering data. The theory of the inverse-scattering transform solution of the Cauchy 
problem for the BO equation was first developed by Fokas and Ablowitz [13]. Certain 
analytical details of the theory were clarified by Coifman and Wickerhauser [B], and more 
recently Kaup and Matsuno [T7] found conditions on the scattering data consistent with real- 
valued solutions of Q. As an operator on EI+(M), the essential spectrum of £ is the positive 
real A- axis (for suitable Ue, £ is a relatively compact perturbation of the "free" operator 
corresponding to = 0). For each fixed t and each real A > 0, there exists a unique 



solution = M of (29) with the property that (remarkably, despite the nonlocal nature 
of the problem) it is determined by its asymptotic behavior as x — ?■ — oo on the real line: 
M{x,t] A) = 1 + o(l) as a; — 7- — oo. As the problem is nonlocal, M cannot be characterized 
by a Volterra-type integral equation, but Fokas and Ablowitz |i3] gave a Fredholm-type 
equation whose unique solution is M. The reflection coefficient for the problem is then 
defined for positive real A by the formula [121 |T7| 

(32) /3(A,t) := - / u,{x,t)M{x,t;X)e-'^''^'dx, A > 0. 

For each fixed x and t the function M{x, t; A) can be shown to be the boundary value taken 
on the positive half-line A G M_|_ from the upper half A-plane of a function W{x,t] A) that is 
meromorphic in the complex A-plane with M^. (a branch cut) deleted. Fokas and Ablowitz 
refer to the boundary value taken by W{x,t]X) on the positive half-line from the lower 
half-plane as N{x,t; A). The poles of W{x,t; A) are all on the negative real A-axis (by self- 
adjointness of £) and correspond to the point spectrum of £. It turns out that one of the 



consequences of the Lax pair equation (30) is that the point spectrum is independent of time 
t. In [T^ it is shown that in the generic case when A„ < is a simple pole of W, the first two 
terms in the Laurent expansion of W at X = Xn are both proportional to the same function 
^n{x,t) G EI"'"(]R2;), which is an eigenfunction of C with eigenvalue A = A„. The ratio of 
these two terms is in fact linear in x: 

(33) W{x, t; A) = -ze ^"^^' + (x + a„(t))'l'„.(x, t) + 0{X - A„.), A ^ A„. 

A — A„ 

Kaup and Matsuno [IT] showed that for real u, the complex- valued phase shift a„(it) may 
be written in the form 

(34) an{t) = 7n(t) - 7^, Init) G M. 

The set of scattering data corresponding to the potential u^{-,t) then consists of: 

• The reflection coefficient /3(A,t) for A > 0. We write /3(A) for /3(A,0). 

• The negative discrete eigenvalues {Xn}n=i, Xi < X2 < ■ ■ ■ < Xn < 0. 

• The real phase constants {■yn(t)}n=i- We write 7„ for 7n(0). 

2.3. Time dependence of the scattering data and the inverse scattering transform. 

As time varies, one may expect the scattering data to vary, but the time dependence as 



implied by (30) turns out to be very simple. As pointed out above, the discrete eigenvalues 
{Xn}n=i constants of the motion, and Fokas and Ablowitz [13\ showed that 

(35) /3(A,t) = /3(A)e*^'*/^ A > 0, 

12 



and 



(36) 7„(t) = 7„ + 2A„t, n = l,...,N. 

The inverse-scattering procedure for solving the Cauchy problem for the BO equation with 
suitable real initial data uo{x) is then to calculate the scattering data at time t = from mq, 



evolve the scattering data forward in time t by the explicit formulae (35) and (36), and then 
solve the inverse problem of constructing u^{-,t) from the scattering data at time t. Generally, 
this requires solving a scalar Riemann-Hilbert problem for W{x, t; A) in the complex A-plane. 
This Riemann-Hilbert problem is quite interesting as it involves a jump condition across the 
continuous spectrum A > in which the boundary value from above, W = M{x,t;X), 
is proportional to an integral from A' = to A' = A of the boundary value from below, 
W = N{x, t] A'). Thus the jump condition is nonlocal, a fact that makes the inverse problem 



almost completely analogous to the direct problem (29) which, after integration becomes a 
nonlocal Riemann-Hilbert problem of exactly the same type in the complex x-plane. This 
fact should perhaps be contrasted with the situation for the KdV equation where the direct 
and inverse problems are of quite different natures. This remarkable symmetry between the 
forward and inverse problems for the BO equation is a theme that will be touched upon 
again in this paper in some detail. 

2.4. The refiectionless inverse scattering transform. If /3(A) = {i.e. the problem is 
reflectionless) , then the boundary values taken by W{x,t; A) on the positive half-line agree, 
so W{x,t]X) is a meromorphic function on the whole complex A-plane with simple poles 



at the negative real eigenvalues. The condition (33) and the normalization condition that 
W{x,t; A) — )■ 1 as A — > oo then provides sufficient information to reconstruct W{x,t; A) from 
the discrete data {Xn}n=i and {■yn}n=i- Via a partial-fractions ansatz for W{x,t;\), this 
amounts to a solving a linear algebra problem in dimension N. Once W is determined in 
this way, one obtains C^[u^{-,t)] by the formula 

(37) C4u,{-,t)]{x) = \im X{l-W{x,t;X)). 

X—>oo 

Since is real, one then has 

(38) u,{x,t) = 2^{C+[u,{;t)]{x)}. 

This procedure clearly leads to a determinantal formula for u^{x, t) in the reflectionless case. 
It turns out to be the same (multisoliton) formula that Matsuno [21] had obtained, before the 
relevant inverse-scattering transform was discovered, by applying Hirota's bilinear method 
to the BO equation: 

(39) n,(a;,t) = 2e^3{log(r,(a;,t))}, 
with the "tau-function" 

(40) r,(x, t) := det(I + ie~^A,), 

where = A^{x,t) is an N x N Hermitean matrix with constant off-diagonal elements 

(41) Anm ■= ^r^")^"^" > n^m, 
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and diagonal elements depending explicitly x and t\ 



(42) 



-2A„(x + 2A„t + 7„) 



In (41) we mean the positive square root of the positive product A„Am. For the purposes of 



this paper, we will only require this reflectionless version of the inverse-scattering transform. 

In his paper [21], Matsuno noted that regardless of the value of iV, the complex determinant 
Te{x,t) satisfies the real equation (Hirota bilinear form of the BO equation) 



(43) 



ie— + 2e'^ — 
dt dx"^ 



t: + -le 



~dt 



dx"^ 



fdr,^^ ^ dr* 
dx \ dx " dx 



The terms on the left-hand side should be compared with the linear Schrodinger equation 
(19). If one makes a formal WKB ansatz of the form Te{x,t) = A{x,t)e^^^^'^'^^^, then the 



terms on the right-hand side of (43) are formally small compared with those on the left- 



hand side, and to leading order in e (43) simply reduces to the inviscid Burgers equation 
with = 2dS/dx (as is consistent with (39)). 



2.5. Conservation laws and trace formulae. As with the KdV equation, the time evolu- 
tion of BO equation preserves an infinite number of functionals of u. These were first found 
by Nakamura [27]. The equivalent representation of these functionals in terms of the time- 
independent portion of the scattering data, i.e. the eigenvalues {\n}n=i and the modulus of 
the reflection coefficient |/9(A)p for A > 0, was obtained by Kaup and Matsuno [17]. These 
identities amount to a hierarchy of trace formulae for the operator C 

The conservation laws take the form dl^/dt = 0, A; = 1, 2, 3, . . . . The integrals Ik may be 
generated by the following recursive procedure: first set pi := 1 and then define 

(44) pk+i{x,t):=C4u,{-,t)pk{-,t)]{x)+ie^{x,t), fc = 1,2,3,.... 
Then, the integrals of motion are 

(45) h{t) := I u^{x,t)pk{x,t)dx, /c = 1, 2, 3, . . . . 



The equivalent spectral representation given in [T7] is 

^ ' -\k^ r+oo 



(46) 4(t) = 27reV(-A„)'-' + ^r^ / \m\'>^'''dX, A: = 1, 2, 3, . . . . 

n=l 

In view of the results presented in ^2^, the latter representation makes clear the fact that 
dik/dt = 0. 

The first two conserved quantities are quite simple and in fact they are the only ones in 
the hierarchy having local densities: 

(47) h '■= fue{x,t)dx and /2 := - f Ue{x,t)'^dx. 
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3. The Scattering Data in the Zero-Dispersion Limit 



In this section we consider the following problem. Given a suitable function Uq{x) rep- 
resenting the initial condition for the BO equation, we wish to determine an asymptotic 
approximation, valid when e > is small, to the scattering data {f3{X), {Xn}n=i, {ln}n=i} 
corresponding to uq. Even though uo is held fixed as e tends to zero, the scattering data will 



depend on e as this parameter appears in the equation (29). As the operator C is nonlocal, 
we cannot rely on the WKB method as is so useful for analysis of differential operators (for 
example, the analysis of Lax and Levermore [2^ was based on the WKB analysis of the 
Schrodinger operator that arises in the scattering theory for the KdV equation). 

3.1. Admissible initial conditions. The type of initial data for the BO equation ([T]) that 
we will consider for the rest of this paper is the following. Many of these conditions are 
imposed for our convenience; we make no claim that they are necessary. 

Definition 3.1. A function : M — t- M will be called an admissible initial condition if it 
has the following properties: 
Smoothness: uq G ^(^^(IR). 
Positivity: Uq{x) > for all x G M. 

Existence of a Unique Critical Point: There is a unique point Xq G M for which 

Uq{xo) = and 

(48) m'o'(xo) < 0, 

making xq the global maximizer of uq- 
Tail Behavior: limx^±oo Uo{x) = 0, and 

(49) lim \x\'^~^'^u'q{x) = C± for some q > I, 

where C+ < and C_ > are constants. These two conditions together imply that an 
admissible initial condition uq also satisfies 

(50) lim \x\'^Uo{x) = =F 



q 



Inflection Points: In each bounded interval there exist at most finitely many points x = C, 
at which Uq{^) = 0, and each is a simple inflection point: u'q{^) ^ 0. 

Corresponding to an admissible initial condition mq we define a positive constant L by 

(51) L := maxMo(x), 

and we let the mass M be defined by 

(52) M := / uo{x) dx. 



Note that the mass is guaranteed to be finite according to (50) since Uq is bounded. Also, if 
Uq is an admissible initial condition, we can define turning points x± : [—L, 0) — )■ M as two 
monotone branches of the inverse function of uo'- uo{x±{X)) = —X and x^{X) < xo < x^{X) 
for -L < A < 0. See Figure |4| 
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Figure 4. The graph of an admissible initial condition and the turning points x±{X). 



3.2. Matsuno's method. In two papers [221 123] . Matsuno proposed a remarkable method 
to approximate, in the hmit e J, 0, the time-independent components of the scattering data 
for suitable uq. His method was based on the conservation laws for the quantities (45). With 
the use of the more recently obtained trace formulae equating Ik as given by (45) with the 



equivalent formulae (46) [TT], several heuristic aspects of the original method given in 
can be placed on more rigorous footing. 

The first key observation made in 123] is that if Uq is a smooth function independent 
of e, then by evaluating the integrals Ik{t) at time t = 0, one sees that they have limiting 



values as e J, 0. These limits may be obtained simply by solving the recurrence relation (44) 
with e = 0: 



(53) 



lim/jt 

40 



uo{x)C+[uoC+[uoC+[- ■ ■ UoC+[uo] 



(x) dx, k = 1,2, 3, 



where the Cauchy projector C+ occurs k — 1 times in the integrand. With the use of an 
identity valid for reasonable complex- valued functions uq{-) and suggested by comparing the 



conserved quantities generated from the Kaup-Matsuno iteration scheme (44) with those 



generated via the older scheme of Nakamura [27], one sees that the right-hand side of (53) 
can be equivalently written in the simple form 



(54) 



lim/fc 

£4-0 



Uo[X) 



doc . k — X , 2 . 3 , 



On the basis of heuristic physical arguments, in [22l 123] Matsuno supposed that for smooth 
positive initial data uq, all moments of the reflection coefficient remain bounded as e J, 0. 



Adopting this hypothesis, a comparison of (54) with (46) then shows that 
(55) 



N 



limey^(-A„^ 

n=\ 



k-1 



27ik 



uo{x) dx, k = 1,2, 3, 
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In particular, taking k = 1 one obtains 



(56) 



lim eA^ = M, 

£4-0 



where the mass M is defined by (52), so the number of eigenvalues is asymptotically pro- 
portional to 1/e. 

These calculations suggest that the normalized counting measure of eigenvalues may have 
a limit in a certain sense as e J, 0, perhaps as an absolutely continuous measure with density 
F{X). Matsuno calculated this density by replacing the left-hand side of (55) with an integral 
against the unknown density F{X): 



(57) 



i-xy-^Fix) dx 



1 



2nk 



uq{x) dx, A; = 1, 2, 3, ... . 



The problem that remains is then the classical one of constructing the density F{X) from its 
moments, which are known if the initial condition uq is given. 

Matsuno showed that, remarkably, this moment problem can be solved explicitly. He 
introduced the characteristic function (Fourier transform) of F: 



(5^ 



F(0:= r F{X)e-'^'dX, 

J — oo 



in terms of which, the moment relations (57) become 



(59) 



2TTk 



uo{x) dx, k = 1,2, 3, 



Recalling the constants L and M defined by (51) and (52) respectively, it is easy to obtain 
the estimate 



(60) 



1 



ik-l)\d^ 



r(o) 



< 



k-l 



k\ 



from which it follows that F{^) is an entire function and hence is equal to its Taylor series 
about ^ = 0: 



m = E 



1 



d 



k-l 



^^{k-iy.de 



k-l 



(61) 



k=l 



k-l 



2Txk\ 



k=l 



uq{x)^ dx 
iiuQ{x)f 



k\ 



dx. 



The combined sum and integral is absolutely convergent for all ^ G C, so the order of 
operations may be reversed: 



(62) 



m 



2mi 



,k=l 



k\ 



dx 
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3'«"o(")/2gjn(l^Mo(a;)) dx. 



By Fourier inversion, 

1 r+R r+R r giC(A+no(x)/2) 

(63) F(A) = :^ Hm / F(e)e'«^ = lim / / sin {l^Uo{x)) dx d^ 

Applying Fubini's Theorem to reverse the order of integration and then passing to the hmit 
R t C)0, the integral over ^ can be evaluated as the indicator function of an interval: 

(64) F(A) = ^^Xh«oW,o](A)rfx. 

This formula shows that F{X) = for A>OorA<— L. By a "layer-cake" argument we 
may simplify this formula for A G (— L, 0) as 

(65) F(A) = ^ I dx, -L<\<0. 

271" J{xm, no(a:)>-A} 

This is Matsuno's remarkable result. We have presented Matsuno's method in some detail 
because it turns out that a key calculation in our analysis of the inverse problem in the 
zero-dispersion limit reduces to almost the same steps, as we will see shortly. This is worth 
emphasizing because it provides further evidence that for the BO equation, scattering and 
inverse-scattering are mathematically very similar operations. 



Matsuno's formula (65) could perhaps be compared with the Bohr-Sommerfeld formula 
that gives the density of eigenvalues of the Schrodinger operator in the zero-dispersion theory 
of the KdV equation [20]; aside from a constant factor the Bohr-Sommerfeld formula replaces 



the unit integrand in (65) with the positive square root ^ uo{x) + A. 

While quite severe hypotheses on uq are required for all of the arguments to go through. 



the formula (65) makes sense under much weaker conditions. In particular, we may interpret 



(65) for an admissible initial condition, in which case we may express F{X) directly in terms 



of the turning points x±{X): 

(66) F(A) := -^(x+(A) -x_(A)), -L < A < 0. 

2tt 



We take (66) as a definition valid for admissible initial conditions uq. Note that 



fO 

(67) / F(A) d\ = M, 



where the mass M is defined by (52). 



3.3. Formula for phase constants. The WKB methods recalled by Lax and Levermore 
[20] to analyze the Schrodinger equation in the forward problem for the zero-dispersion limit 
of the KdV equation were sufficiently powerful to provide asymptotic formulae for both the 
discrete spectrum (the Bohr-Sommerfeld formula that is the analogue in the KdV theory of 
the function F{X) obtained by Matsuno) and also for the "norming constants" that in the 
KdV theory are the analogues of the phase constants {7n}^=i in the BO theory. However, we 
have not found a way to apply these methods to the nonlocal operator C, and unfortunately 
Matsuno's method does not provide approximations of the phase constants {■jn}n=i since 
they do not enter into the trace formulae. 

Our contribution to the theory of the spectral analysis of the nonlocal operator C in the 
zero-dispersion limit is to provide a new asymptotic formula for the phase constants. It is 
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difficult to motivate the formula as it arises from the analysis of the inverse problem that 
we will describe in the next section, but it is nonetheless quite easy to present. If A < 
is an eigenvalue of L with potential u given by an admissible initial condition mq, then our 
approximation to the corresponding phase constant is given in terms of the turning points 
x±{X) as follows: 

(68) 7 ~ 7(A) := -\{x^{X) + a;_(A)), -L < A; < 0. 

Remark 3.1. Our choice of 7(A) in terms of uq is specifically designed to ensure the con- 



vergence of Ue{x,t) (to be defined precisely in Definition 3.2 below) at t = to the given 
e-independent initial condition uq. 

3.4. Modification of the Cauchy data. Based on the above considerations, we may 
now make very precise definitions of formal (not rigorously justified) approximations of the 
scattering data corresponding to an admissible condition uq. The first approximation is to 
neglect the reflection coefficient by setting 

(69) /3(A) := 0, A > 0. 

Next we define the exact number of approximate eigenvalues (hopefully also the approximate 
number of exact eigenvalues) by setting 

M 

(70) N{e) := 

which in particular implies that 

(71) limeiV(e) = M. 

Then we define approximations to the eigenvalues themselves as an ordered set of numbers 
{Xn}n=i C (— -L,0) obtained by quantizing the Matsuno eigenvalue density given by (66): 



(72) / F{\)d\ = e[n-l] , n = 1, 2, ■ ■ ■ iV(e). 



Finally, we define approximations to the corresponding phase constants as numbers {'jn}n=i 
given precisely by 

(73) 7n:=7(A„), n = l,...,iV(e). 



where 7(-) is defined by (68). 



Now in our analysis of the Cauchy problem for the BO equation with admissible initial 
data Uq we take a sideways step that is not a priori justified: we simply replace the true 
solution u^{x,t) of the Cauchy problem with a family u^{x,t) of exact solutions of the BO 
equation ([T]) with the property that for each e > the scattering data for u^{x,t) at time 
t = is exactly the approximate scattering data just defined. This step was also an important 
part of the method of Lax and Levermore [20J. We formalize this modification of the initial 
data in the following definition. 
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Definition 3.2. Let uq be an admissible initial condition. Then, by Ue{x,t) we mean the 

exact solution of the BO equation ([T]) given for each e > by the reflectionless inverse- 
scattering formula 

d 



u,{x,t) :=2e— ^J{log(f,(x,t))}, 



(74) 
where 

(75) f,(x, t) := det (l + ie'^A, 

and where = A^{x,t) is an A^(e) x A^(e) Hermitean matrix with elements 



(76) 

and 
(77) 



Ar. 



-2A„(x + 2A„t + 7„ 



-, n ^ m 



-2A„(x + 2A„ + 7(A„)). 



Here the number N{e) is defined by (70) and the components of the scattering data {\n}n=i 
and {'^n}n=i given explicitly by (72) and (73) respectively. 



While it is not the case that £te(x,0) = uq{x) in general, the relevance of this defini- 
tion in connection with the Cauchy problem with initial condition uq is a consequence of 
Corollary |1.1| which guarantees convergence in the mean square sense of Me(-,0) to mo(") as 

e;o. 



The proof of Theorem 1.1 will be given below in q4l Before embarking on that we note 



that Definition 3.1 implies a number of properties of the functions F and 7 that will be useful 
later, so we take the opportunity to record these here. Note that F and 7 will frequently 
occur in the context of the following functions: 

(78) L>(A;x,t) := -2A(x + 2At + 7(A)), -L < A < 0, 
and 

(79) (^(A) := v/-AF(A), -L < A < 0. 

Lemma 3.1. Let uq be an admissible initial condition with decay exponent q > 1, and let 



F : [-L,0) be defined by (66) and 7 : 0) ^ R be defined by (68). Then F and 



7 both belong to C^^\—L,0) and F and F' are strictly positive on this open interval. Also, 
there exists a sufficiently small constant S > and positive constants C_l and Cq such that 

(80) Ic^lVlTx < F{\) < c^lVlTx, 



and 
(81) 



4:^/L + X 

both hold for —L < A < —L + 6, while 

1 



2 v/TTA 



(82) 



Coi-X)-'/' < F{X) < Co(-A)-i/^ 
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and 
(83) 



2 q ^ 

both hold for —6 < A < 0. ^4/50, 



(84) 



|7(A) +a;o| ^ 7rF(A) and 



|7'(A)| < 7rF'(A), -L<A<0, 

inequalities that when combined with (80)-([83| imply obvious upper bounds for |7(A) +Xo| 
and |y(A)|. 

In particular, these estimates show that F{X) is integrable, and (f{X) and D{X;x,t) (and 
hence also Xx±{X)) are bounded, and that with o = min(^, 1 — -) G (0,1), ip{-) is Holder 
continuous with exponent a/2 while D{-;x,t) is Holder continuous with exponent a uniformly 
for {x,t) in compact sets, on (— L,0). 

Proof. The turning points a;-|-(A) are clearly of class C^^\—L,0), by definition x+(A) > 
Xq > x_(A) on this open interval, and moreover x+(A) is strictly increasing while a;_(A) is 
strictly decreasing on (— L,0). These facts immediately imply the desired basic smoothness 
properties of F and 7, and the positivity and monotonicity of F, as well as the inequalities 

m. 



Since Uo{xo) = L and Uq{xo) = 0, the C(^)(]R) function uq satisfies 



^5) 



lim 



uo{x) - L Uq{xo) 



and lim 



m'o(x) 



Uq{xo). 



x^xo [X — Xq) I x^xf, X — Xq 

Using these together with the inequality Uq{xq) < 0, the definition of x±{X) as branches of 
the inverse function of uo shows that 



(86) lim 

H-L 



±{X±{X) - Xq) 

VlTx 



-m'o'(xo) 



and lim ±x\(X)VL + A 

Xi-L 



-2<(xo)' 



Using these relations in (66) and (68) establishes the existence of the limits 
F(A) 



^7) 



lim , 

H-L y/L + X 




and lim F'(X)VL + X = — 
H-L 27r 



-u'^{xo) ' 



-u'^ixo) 

which prove the two-sided estimates ( [80|) a nd (81). 

Next, note that the decay conditions (49) and (50) for uq and its derivative together imply 
that 



limx±(A)(-A)i = ± ( T 

AtO 



and lima;'_|_(— A) 'J^"'^ = ± 



Ato 



where =fC± are the positive constants in (|49|) and (|50|). It follows from (|66|) that 



^9) 



limF(A)(-A); 

Ato 



1 

2^ 



1 



q 



which proves ( 82 ) and 
(90) 



limF'(A)(-A); 

Ato 



27rg 



.9± 
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which proves (83). 



□ 



4. The Inverse-Scattering Problem in the Zero-Dispersion Limit 



In this section, we provide the proof of Theorem 1.1 



4.1. Basic strategy. Outline of proof. According to Definition 3.2 u^(x,t) is expressed 
in terms of the determinant as follows: 

dU 

(91) u,{x, t) = —^{x, t), U,{x, t) = 2e53{log(f,(x, t))}. 

ox 

As the logarithm of a complex- valued quantity is involved, Ue{x,t) is only defined modulo 
4:716 for each (x, t), and naturally one should choose the appropriate branch for each (x, t) to 



achieve continuity. We do this concretely in equation (93) below. 

At this very early point our analysis must take a very different path than that followed 
by Lax and Levermore [20j in their study of the zero-dispersion limit for the KdV equation. 
Indeed, the expansion of in principal minors that is at the heart of the Lax-Levermore 
method would be a poor choice in this situation. One reason for this is simply that the 
principal- minors expansion of f^{x,t) consists of complex-valued terms of indefinite phase, 
so the sum cannot be easily estimated by its largest term. But a more important reason is 



that the formula (91 ) for U^{x, t) involves not log(fe) but rather 53{log(rJ}, that is, we require 
an estimate of the phase of the determinant and we are not interested in its magnitude. 

So instead of expanding the determinant as a sum, we write it as a product. Let {a„}^if 
be the real eigenvalues of Ae(x, t). Then the corresponding eigenvalues of 1 + ze^^Ae(a;, t) are 
of course {1 + ie~^a„}^if , so we may expand as a product over eigenvalues in the form: 

N{,e) 

(92) f,(x,t) = J] (l + ze-V). 

n=l 

This yields a suggestive formula for Ue{x,t) in terms of the eigenvalues of A^: 

N{e) 

(93) [/.(x, t) := e ^ 2 arctan (e"^a„) . 

n=l 

Here —tt/2 < arctan(-) < 7r/2, so in particular by this definition we have made an unambigu- 
ous choice of the branch of the logarithm. This formula seems at first not to be of much use 
because, unlike the principal minor determinants in the Lax-Levermore method which can be 
written explicitly in terms of the matrix elements, the eigenvalues of A^ are only implicitly 
known. However, numerical experiments suggest that some structure emerges in the limit 
e J, 0. Indeed, the plots shown in Figure [s] provide good evidence that the normalized (to 
mass M) counting measures /i^ given for e > by 

M ""-"^ 

(94) /i, := ^^^5^^°"' eigenvalues of A, 

might converge in some sense to a measure /i having a density G{a;x,t). This convergence 



suggests further that the formula (93 ) could be interpreted as a Riemann sum, for the integral 
of 7rsgn(a) (the pointwise limit as e J, of the summand) against the limiting measure /i. 
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Figure 5. Histograms of eigenvalues of corresponding to the initial con- 
dition uo{x) := 2(1 + X = 5, and t = 2, normalized to have total area 
M = 1, compared with the density G{a;x,t) of the limiting absolutely contin- 
uous measure /i. 



We will prove that indeed U^{x,t) converges, uniformly with respect to x and t in compact 
sets, to a limit function U{x, t) given by such an integral in the limit e J, 0. 

To obtain an effective formula for U {x, t) we need to analyze the asymptotic behavior of 
the measures /i^. This part of our analysis is modeled after the work of Wigner [321 E3] on the 
statistical distribution of eigenvalues of random Hermitian matrices with independent and 
identically distributed matrix elements. Like Wigner, we use the method of moments because 
while the measures themselves are not easy to express in terms of the matrix elements, their 
moments are: 

r M ^^"^ M 

(95) y^c.Pc?/i,(«) = ^5^«j:=^tr(An, p = 0,1,2,.... 



We prove the existence of the limit of the right-hand side in equation (95) as e J, for every 
p using the fact that for small e the matrix concentrates near the diagonal, where it can 
be approximated by the product of a diagonal matrix and the Toeplitz matrix corresponding 
to the symbol f{9) := i^n — O), < 6* < 27r (of singular Fisher-Hartwig type due to jump dis- 
continuities). The result of this asymptotic analysis of moments is the following Proposition, 
the proof of which will be given below in §4.2 



Proposition 4.1. For each nonnegative integer p, 



(96) lim / a''' djj,^{a) = Qp, 
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with the limit being uniform with respect to {x, t) in any compact set, where 



(97) Qp :-- 



— ^— r [(a; + 2At-x_(A)f+'-(x + 2At-a;+(A)f+n {-2Xr dX. 
2tt{p+1) J_l'- 



Given these limiting moments, the next task is to estabhsh the existence of a corresponding 
hmiting measure fi with these moments, and to prove the existence of the hmit Ue{x,t) — > 
U{x,t). A remarkable feature of this analysis is that the solution of the moment problem 
for /i is carried out by virtually the same procedure as Matsuno used to obtain the function 



F(A) from uq (see ^3.2). Our result is the following Proposition, that will be proved in all 



details in ^4.3 



Proposition 4.2. Uniformly for (x,t) in compact sets, 

(98) limU,{x,t) = U{x,t), 

where 

(99) U{x,t) := / TT sgn{a) dfi{a) 



and where /x is an absolutely continuous measure of mass M with density G{a;x,t), and 

1 f dX 

/ X[-2X{x+2\t-x+{X))-2X{x+2Xt-x^(X))][Oi)- 



(100) 



G{a; X, t) :- 



X 



Here, X[a,b]{,z) denotes the indicator function of the interval [a, 6]. 



The limiting measure is the closest analogue in the zero-dispersion theory of the BO 
equation of the equilibrium (or extremal) measure arising in the Lax-Levermore theory of 
the KdV equation. But a significant difference is that in this case the measure /i is specified 
explicitly rather than implicitly as the solution of a variational problem. 



The region of integration in the double integral obtained by combining (100) with (99) 



is illustrated for three different values of (x, t) in Figure rol The points where the boundary 




-1.5 -1 -0.5 
A 




Figure 6. The region of integration — 2A(x + 2At — a;+(A)) < a < —2X{x + 
2Xt - a;_(A)) for uq{x) = 2(1 + x^)-^ with t = 0.7. Left: x = 2 (to the left 
of the oscillatory region for Ue{x,t)). Center: x = 2.5 (within the oscillatory 
region for Ue{x,t)). Right: x = 3 (to the right of the oscillatory region for 
u^{x,t)). The line a = of discontinuity of the integrand is superimposed, 
and the intersections of the boundary with this line are indicated with arrows. 



curves of this region intersect the line a = (where the integrand is discontinuous) obviously 
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will play an important role in the differentiation of U{x,t) with respect to x. Moreover, 
these intersection points correspond (simply by changing the sign) to the branches of the 
multivalued solution of Burgers' equation with initial data Uq. This explains their appearance 
in the formula for the weak limit of u^{x,t). All details of this calculation will be given in 
^4.4, which will complete the proof of Theorem 



4.2. Asymptotics of traces of powers of A^. Proof of Proposition 4.1 The definition 
(72) imphes that where F{X) is bounded and bounded away from zero, the numbers {Xn}n=i 
are locally nearly equally spaced, but they are more dilute near the "soft edge" of the 
spectrum A = —L and more dense near the "hard edge" of the spectrum A = 0. Taking into 
account the soft edge behavior we may obtain a uniform estimate: 

Lemma 4.1. There is a constant Cx> independent of e such that 

(101) |A„ - A™| < CAe^^V-^r^^ 

holds for all n and m between 1 and N{e). 

Proof. Since F is a monotone increasing function with F{—L) = 0, it is bounded away from 
zero except in a right-neighborhood of A = —L. Using the lower bound given in (80) from 
Lemma 



3.1 



we obtain a lower bound F{X) > C\/L + \ valid uniformly for —L < A < with 
< C < C-l/2. Then, using the definition (72) we have (assuming n > m without loss of 
generality) 

/"An /"An /"|A„ — Ami O/^ 

(102) e|n-m| = / F{X)dX>C v^LTXdX>C = ^\K-~Xmf^\ 

so the desired inequality follows with Cx '■= (2C/3)^^/'^. □ 
We decompose the matrix A^ into a sum A^ = D + H of its diagonal part 

(103) D :=diag(Di,D2,...,^7V(e)), := DCXk;x,t), 

where D{X]x,t) is defined by (78), and its off-diagonal part H whose matrix elements are 
given by 

(104) Hnm = for n ^ m, and if„„ = 0. 

We also will soon need the quantities {(pn}n=i defined by 

(105) ifn:=^{K), n = l,...,N{e), 



where ^{X) is given by (79). 

Lemma 4.2. There is a constant > and for each R > there is a constant Cd,r > 
such that 

(106) \^n\ < 

and 

(107) sup < Cd,r 
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both hold for all e > and all n between 1 and N{e). Also, 



(108) 

and 

(109) 



m 



a/3 



sup 



\D„. - 



both hold for all e > and for all n and m between 1 and N{e). Here a is the positive Holder 



exponent of Lemma 3. 1 



Proof. Thi s is a n easy consequence of the Holder continuity of </?(■) and D {-\ x , t) guaranteed 
by Lemma 3.1, and of the spacing estimate for {Afc}^^"* given in Lemma 4.1 In fact, since 
D is Holder continuous with exponent cr while if has exponent a/2 the most natural bound 
for \Dn — -Dm I is proportional to e'^'^^^\n — mp'^/^, and to obtain (109) we use the fact that 
e\n — m\ < 2eN{e) is uniformly bounded to reduce the exponent to a/3. □ 

Lemma 4.3. There is a constant Ch > such that 

(110) \in-m)Hn^\<CH 

and 



(111) 



m 



a/3 



both hold for all e > and all n ^ m between 1 and N{e). Again, a > is the Holder 



exponent of Lemma 3.1 



Proof. Suppose without loss of generality that n > m, implying that Am < A„ < 0. Then 



-i(n — m)Hr, 



(112) 



~ e{n - m) 

"^n V ~^m^ ~ 

< ii-U + [-Am]) I ' 

~ e(n — m) 
= ein — mi — 2A„ — 



An Am 

Now, recalling the definition (72) of the numbers {Xk}k=i cind applying the Mean Value 



Theorem we may write the latter difference quotient as F(^) for some ^ with Am < ^ < A„, 
and since F is increasing we have F{C,) < F(A„), so 

(113) - i{n - m)Hnrn < 2e(n - m) - 2A„F(A„) = 2e{n - m) + 2ipl, 

where we have also replaced e{n — m) with 2e{n — m). On the other hand, we may write 

(114) — i[n — m)Hnm = 2e{n — m) 



-Xr 



"Am ~l~ Am 



~ e(n — m) 

2Xr ^ ' 



Xn Am Xn Aj- 

Again the difference quotient may be replaced by F{^) > F{Xm), and since 
(115) V^V^ 



An Am 



-A. 



An + V A^ 



> -1, 
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we obtain 

(116) - i{n - m)Hnm > -2e(n - m) - 2XmF{Xm) = -2e{n - m) + 2^9^ 

Combining (113) and (116) gives 
(117) 



[n 



m)Hnm - 2iLpn^m\ < 2e|n - m\ + 2max{ipn,^m}\^r 



and then applying Lemma 4.2 we obtain 



:ii8) 



\{n - m)Hnm - 22V5„V5m| < 2e|n - m| + 2Cle'^^^\ 



n 



m 



a/3 



Now, < e\n — m\ < 2eN{e), and this upper bound has a hmit as e ^ 0, so e\n — m\ is 
nonnegative and bounded. Since a < 3 we have therefore proved (111). Since ^n^m and 
e\n — m| are bounded, (110) then follows from (111). □ 

For any nonnegative integer power p, the pth moment of the measure /i^ can be written 
in terms of D and H with the use of (95): 

(119) 



j=0 

where Zpj contains the contribution to the trace coming from products of matrices involving 
exactly j factors of H: 

(120) Zp^ := J2 (D^^H'^^ ■ ■ ■ D'^'^H'^^) , 

hi+h2A \-hs=j 

and where di > and hg > 0, while dk > for 2 < k < s and /i^t > for 1 < A; < s — 1. 
Since p is a fixed number, it will suffice to compute the limit of Zpj as e 4 for j = 0, . . . 
Actually, it will be enough to consider even values of j as the following result shows. 

Lemma 4.4. If j is an odd number, then Zpj = 0. 

Proof. Since tr(M) = tr(M"^) for all square matrices A, 
N{e) 



M 



(121) 



di+d2-\ \-ds=p-j 

hi+h2-\ hhs=j 



where in the second line we have used the facts that 
the terms in the sum we therefore obtain 



J2 tr (^(D'^iH'^i---D^^H'^^)^' 
ly J2 tr (H'^^D'^^ ■■■H'^iD'^i) 

D and H""" = — H. By relabeling 



di+d2-\ hrfs =p-j 

hi+h2A \-hs=j 



:i22) 



M 



□ 



Since N{e) > and M < oo, the desired result follows. 

An important role will be played below by the Toeplitz (discrete convolution) operator 
Tf : iHZ) ^e{Z) defined by 

(7/c)^ := ^ fn-mCm, {Cm}meZ € ^^(Z), 



(123) 
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where {fn}nez € ^^(Z) is the sequence 
(124) /„ := 



n 
0, 



n ^ 
n = 0. 



Lemma 4.5. For any even positive integer j, we have 



:i25) 



E /- 



n2,...,nj 



.1=2 



in 



J + 1 



where the j — 1-fold infinite sum converges absolutely. 

Proof. Note that since {/„}„6Z e {gn}n(^z e ^^(Z) as well, where gn := |/n| for all 

n E Tj. The corresponding Fourier series converge in the mean-square sense to functions /(■) 
and g{-) in L^[0,27t]: 

And 



(126) 

and 
(127) 



f{e):=Y^fne^ 



Q <d <2ti 



nei 



gne 



'InO 



log(2(l -cos(^))), O<0<27r. 



First we establish the absolute convergence of the series on the left-hand side of (125). 



Using (123), observe that 
(128) 



-rit+i I 

.1=2 



where Tg is the Toeplitz operator associated with the sequence {gn\n&- Now, g{-) has 
a logarithmic singularity at ^ = (mod 27r), but this is sufficiently mild that g{-)™' G 
L^[0, 27r] C -^v^[0, 27r] for any positive integer power m. Now for any function k[-) G iv^[0, 27r], 
the corresponding Fourier coefficients are 

'•271 



(129) 



1 

2^ 



so in particular we see that {T^ ^g)^ is the average value of the function whose Fourier 
coefficients are {{T^~^g)n}n&- But by the convolution theorem: 



(130) 

so it follows that 
(131) 



Ufi—m'^m 



w{9) = u{9)v{e), 



2lT 



g{eyde 



1 

'0 2^ 

which is finite because (/(■)■' G L^[0,27r]. 

Now we find the exact value of the j — 1-fold infinite sum by the same reasoning: 



(132) 



"2 



n2,...,nj 



fni-ni+i 

.1=2 



2ti 



2tt 



fioyde, 
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and by direct calculation using (126), 



(133) 



27r 



2-K 



fioyde 



2tt 



2lT 



Vde 



{my 



for j even (the integral vanishes by symmetry for j odd). 



□ 



Now we consider separately each of the terms in Zpj for j even. 
Lemma 4.6. If j is an even number and hi + ■ ■ ■ + hg = j while di + ■ ■ ■ + dg = p — j , then 



(134) lim ^tr (D'^^H'^^ ■ ■ ■ D^^H'^^) = j'' D{X; x, tf-' ^{\f' F{\) dX, 



with the limit being uniform with respect to {x, t) in any compact set. 

Proof. Recalling the matrix elements -D„ and Hnm of D and H respectively, we have 



N(e) 

(135) tr (D°'^H'^i---D'^=H^^) = 

ai^a2,...f(ij=l 



n 

i=l 



n 

.£=1 



H, 



Hajai 1 



where the exponents mi, . . . , m,- are given by 



(136) 



di, i = 1 

rrii := < (ib+i, i = 1 + hi + h2 + ■ ■ ■ + hi, for some < 6 < s 
0, otherwise. 



Note that mi + m2 + ■ ■ ■ + mj = di + d2 + ■ ■ ■ dg = p — j . 

Now, the matrix element Hnm is relatively small unless n ^ m, and this suggests that the 



j-fold sum in (135) should concentrate near the diagonal, where = ai for all k. Making 



this precise, given any r > we will first show that 



(137) 
where 



limZoD(e) = 0, 

e4,0 



(138) 



^OD(e) := 



M 



N{e) 

E 

ai,a2,...,aj=l 
3fc:|as. — ai|>e^' 



i=l 



J-1 
.i=l 



H, 
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with the hmit being uniform for {x,t) in compact sets. Indeed, if + < i?^, then using 
(107) from Lemma 4.2 and (110) from Lemma 4.3 we obtain 



ZoD{e)\ < 



iV(e) 



N{e) 

E 

ai,a2,.--)Ctj=l 
3fc:|afc-ai|>e-' 



■j-l 

—o-l+i I 

.1=1 



I faj—ai I 



(139) 



ai=l a2,a3,.--5Cij=l 
3/c:|afc — ai|>e~^ 

7V(e) 



A^(e 



iV(e) 



ai=l a2,a-j.,...,aj& 
3k:\aj^—ai\>t~ 



— <if_i_i I 

.1=1 



|/a, — ai I 



I/a, — ai I • 



With the inner sum extended over V ^ in this way, it becomes independent of the outer 
sum index oi as can be seen by the substitution Uk = at — cti for A; = 2, 3, . . . j. Thus 



(140) 



n2,n3,...,njeZ 
3fc:|nfe|>e-'- 



-ne+l I 

.1=2 



\fn 



and the latter upper bound is of course independent of (x, t) with + < and tends to 



zero for r > by Lemma 4.5 



It follows from (137|) that 
(141) 



M 



lim — -tr (D'^i H'^i • ■ ■ D^^^ H^O = lim (e) 



where the diagonally-concentrated terms are 

M 



(142) 



iV(e) 



A^(^) r i 

Vfc:|a,fc — ai|<e~^ 



.i=l 



Hajai • 



We will analyze Z]:,{e) under the additional assumption that r < 1. 



The first step is show that if r < 1 each occurrence of Hnm in (142) may be replaced by 



2iipn^rnfn-m without affcctiug the limiting value of Z-[){e) as e J, 0. Indeed, by making this 



substitution j times in succession each time keeping track of the error using Lemma 4.3 along 



with the estimates (106) and (107) from Lemma 4.2, one sees that with Kji > defined by 



(143) 



Kr: 



k+l 



k=l 
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for all j-tuples of integers ai, . . . , aj between 1 and A^(e) satisfying \ak — ai\ < e ^ for all k, 



(144) 



1=1 



Ha,a, - (2Z) 



.1=1 



■i-i 
.e=i 

■j-l 

(l-r)o-/3 



fa^—ai 



I /a,-— ai I 



Therefore, if we define a modification of ^D(e) by 



(145) 



we have 



{2^yM g 

VA,':|afc-ai|<e"'' 



N{e) 



.4 = 1 



\Zj,{e) - Zl{e)\ < 



MKne 



N{e) 



(146) 



(l-r)<x/3 ^(^) 

- E 

VA::|afc— ai|<e^^ 



faj—ai } 



\faj—ai I 



ai=l \ a2,a3,.--,a)62 



N{e) 



.i=l 



I/aw— ai I 



By the substitution rii = ae — cii one sees that the inner sum is independent of ai, and it is 
finite by Lemma 4.5 Since a > and r < 1, we therefore have 

(147) limZD(6)=limZi,(6) 

uniformly for + < i?^ . 

The second step is to show that if r < 1 we may replace D^^ip"^, with D'^^'(f'^_^ for each i 
in (145) without changing the limiting value of Z^^^e). Indeed, applying Lemma 4.2 we see 
that with K]^ > defined by 



(148) 



we see that for all j-tuples of integers ai, . . . , between 1 and A^(e) satisfying \ak — Oi | < e" 
for all k. 



(149) 



n/}"^»^2 _j^p-j 2j 



i=l 



Hence, definining a subsequent modification of Z\^{e) by 
(150) 



]V(£) 



ai =1 



a2,a3,...,aj=l 
Vfc:|afc-ai|<e-' 
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J-1 
.1=1 



/a, — ai ; 



we see that 



(151) 



< 



N{e) 



Nie) Nie) 

ai=l a2,a3,...,aj = l 
VA:|aj. — ai|<e~'' 

N{e) / 

ai=l \a2,a:j,...,ai£Z 



.e=i 



Hi/-. 

/=1 



I/a, — ai I 



\faj—ai I 



and so exactly as before 
(152) 



limZ^,(e)=limZ^}(6) 

e4,0 elO 



uniformly for x"^ + < R'^. 

The third step is to show that if r < 1 one may neglect a small fraction of the terms in the 
outer sum corresponding to Oi < 1 + e"** and Oi > A^(e) — without changing the limiting 
value of Z^{e). Indeed, defining the index set 



(153) 

and then setting 



(154) 



S, ■={nEZ, 1 + e"" < n < N{e) - e"''}, 
{2iyM 



Nie) 



a2,as,...aj=l 
Vfc:|afc— ai|<e~' 



n/. 



ae—ae^i 



faj—ai } 



we easily obtain from (106) and (107) in Lemma 4.2 that 

, , 2WC^1C2^- ^ ^ 

\Z'^{e) - ZE\e)\ < ^^.^ ^ Yl E 

^ ' ai=l a2,a3,.--iij=l 

ai^-SE Vfc;|afc-ai|<e-'- 



ni/. 



I/aw— ai I 



(155) 



< 



iV(e) 



E E ^ 

ai=l a2,az,...,aj(i'l 



Hi/-. 

1=1 



But the inner sum is independent of ai and is convergent by Lemma |4.5| and the outer sum 
has 0(e~^') terms while A^(e) is proportional to e~^, so with r < 1 we have 



limZ^J(e) = limZ^,"(6) 



(156) 

uniformly for + < R'^. 

The next step in analyzing Z-Q^e) is to deal with the inner sum in the definition (154) of 
Zp^(e). Taking into account the conditions on ai in the outer sum, it is obvious that the 
conditions 1 < a/c < A^(e) are superfluous in the inner sum: 



(157) 



= ^ E D^'^ E 



Nie) 



a2,a3,...,aj^I 
VA;:|afe-ai|<e- 
32 



faj—ai ■ 



By introducing the differences Uk 
independent of ai: 



Oi it now becomes clear tliat tlie inner sum is 



(158) 



{2iyM 



1 




\ 












frij 


n2,n3,.--)"jSZ 


J=2 





\ Vfc:|nfc|<e- 



Now, according to Lemma 4.5, tlie latter sum has the limit {iiry / {j + 1) as e J, with r > 0, 
so 



(159) 

uniformly for + < i?^ , where 



limZ^,"(6) = limZ^r(e) 

£4-0 e4-U 



(160) 



M 



(27r). 

j + 1 iV(e) 



The final step in the analysis of Z-^le) is simply to evaluate the limit on the right-hand 
side of by recognizing the sum as a Riemann sum for an integral: 



(161) 



liHiZD(e) = limZ^)^(e) 



40 



£4,0 



(270 

J + IJ- 



D{\;x,tY-^ip{\f^F{\) d\. 



Note that since the summand D^^^ipj^^_^ is polynomial in x and t, the convergence of the 
Riemann sum is uniform for (x,t) in compact sets. Comparing with (141) we see that the 
proof is complete. □ 



Now we may complete the proof of Proposition |4.1[ Lemma |4.6| shows that each of the 
terms in the formula (120) for Zpj has the same limit as e J, 0. Therefore, for all even j, 

fO 

D(A;x,t)P-V(A)^^F(A) d\ 



(162) 



limZr,,- = \ - — ^— 
eiO ^ 7 + 1 

di+d2+---ds=p-j 

hi+h2-\ \-hs=j 



V\ (2vr)J' 



J J J + 1 







D{\-x,tf-^^{\f^F{\) d\. 



Combining this result with Lemma 4.4 and the formula ( 119[ ) for the p*^ moment, we obtain 
(163) Q, = lim^a^d/x.(a) = g {j^^^^ j\{\-x,tY-''^^{\Y'F{\)d\ 



uniformly for (x, t) in compact sets. Now we apply the identity 



(164) 



Lp/2J 

E 

fc=0 



1 



2fc + 1 V2A; 



{b + a 



(b-a) 



2a{l+p) 



holding for any integer p > and real numbers a and b. (This identity can be most easily 
obtained by expanding the binomials on the right-hand side.) Recalling the definitions (78) 
and (79) of D{X]x,t) and v^(A), and using the fact that x±{\) = ±nF{X) — 7(A) then 



completes the proof of Proposition 4.1 
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4.3. Convergence of measures and locally uniform convergence of If^. Proof of 
Proposition 4.2, Recall the measures fi^ defined by (94). 



Lemma 4.7. For each nonnegative integer p, 



(165) 



lim / d^^{a) 

e4,0 



where /i is the absolutely continuous (with respect to Lebesgue measure on measure defined 
by dix{a) = G{a;x,t) da, and the compactly supported integrable density function G{a;x,t) 
is given by (100). The limit is uniform with respect to {x,t) in compact sets. Also, like each 
fie, is a measure with mass M. 



Proof. Recalling Proposition 4.1, we first show that the given measure /i satisfies 



(166) 



/ dfi{a) = Qp, 
Jr 



where Qp is given by (97), for all nonnegative p G Z. Equivalently, we may construct a 
measure with the desired moments as follows: the characteristic function of the measure /x 
is the Fourier transform 

(167) G{^;x,t):= [ G{a; x,t)e-'''^ da, 

and this function necessarily has the desired moments {Qpj^o derivatives at ^ = 0: 



(168) 

So G{C,;x,t) has the Taylor series 
(169) 



dPG 



{^■x,t) = {-iYQ, 



G{i-x,t) = Y,^-^Qp. 



p=0 



p\ 



Now from the obvious inequality \x + 2\t — x-|-(A)| <\x — Xq\ + 2L\t\ + 27rF(A), we obtain 
(170) 



\Qp\ < 



7r(p + l) 



(Ix-xol + 2L|t| + 27rF(A)) 



p+i 



-2XYd\ 



-L 




< 



7c{p 



1 r 

— - / {2L\x - xo\ + 4L2|t| - 47tXF{X)Y{\x - xo\ + 2L\t\ + 27rF(A)) dX. 

+ 1) J-L 



Also, from Lemma 3.1, there is a constant K > such that < — A-F(A) < K, so for 

{X - Xof + t^< 

{2LR + AL'^R + A'nKy ''^ 



\Qp\ < 



(171) 



< 



7l{p + 1) 

{LR + 2L'^R + 2ttM) 



J {\x-xo\+2L\t\+27rF{X)) dX 



n{p+l] 



{2LR + AL'^R + AnKY 



1 



< -{LR + 2L'R + 2'nM)(2LR + AL^R + A-kKY, 
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where in the last step we used (67). This inequahty imphes that the Taylor series (169) 
converges for all ^ G C to an entire function of exponential type. 

Now we will sum the Taylor series ( 169 ) in closed form by substituting from the formula 



(97) and exchanging the order of summation and integration. Indeed, since 
{-iO^ (-2A)P(x + 2At-x±(A))P+i e2mx+2xt-x±{x)] _ 



^^^^^ ^ p! p + 1 2ii\ 

we obtain the formula 

/O 2iiX[x+2Xt-x-{X)] _ 2iiX{x+2Xt-x+{X)] 
, 

Computing the inverse Fourier transform 

(174) G(a; x,t) = ^ [ G{^; x, t)e^"« 



by exchanging the order of integration leads directly to the claimed formula (100) 



It is obvious that G{a; x,t) is a nonnegative function, and since by Lemma 3.1 



(175) inf -2A(x + 2At - s+(A)) > -oo and sup -2A(2; + 2At - x_(A)) < +oo 

-L<X<0 -L<X<0 

for every {x,t), it is clear that G{a;x,t) has compact support. It is also straightforward to 
verify that has mass M: 



dfi{a) = I G{a;x,t)da 

1 f . .dX 



Air 




X[-2X{x+2Xt-x+(X))~2X{x+2Xt-x^{X))]i0!) — da 
L ^ 



1 r 1 



(176) 47r J_L A 

1 I r~2X(x+2Xt-X-{X)) 



X[~2X{x+2Xt-x+(x))-2X{x+2Xt-x-(x))]{.(y) da dX 
dadX 

A J ^2X{x+2Xt-x+{X)) 



F(A) dX 

■L 

= M, 

according to ( [67) ). Therefore fx is indeed an absolutely continuous compactly supported 
(nonnegative) measure of mass M. □ 

Note that the reconstruction of the the measure fi from its moments is virtually the same 
calculation as took place on the direct scattering side in our discussion of Matsuno's method 



in ^3.2 



Lemma 4.8. There is a compact interval C M containing the support of all of the measures 
{fj,e}e>o as well as that of the measure fi, and Q may be chosen independent of (x,t) in any 
given compact set. 
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Proof. Since /x has compact support certainly contained within the interval 



(177) inf [2Xx+{X)]-2L\x\-AL'^\t\ <a < sup [2Xx4X)] + 2L\x\ + AL'^\t\ 

-L<\<0 -L<A<0 

that is clearly bounded uniformly for {x,t) in any compact set, it is enough to show that 
the support of /i^ is uniformly bounded as e | 0. But by definition of /i^ this is equivalent to 
showing that the eigenvalue of with the largest magnitude remains uniformly bounded 
as e J, 0. 

Since the matrix is Hermitian, we have 



(178) IIAglU = max \a 

l<j<N(e) ■ 



SO to prove that the eigenvalue of A^ with the largest magnitude remains uniformly bounded, 
it is completely equivalent to prove that the i"^ (induced) matrix norm of A^ is uniformly 
bounded as e J, independent of (x,t) in any given compact set. 



Recalling the decomposition A^ = D + H from the proof of Proposition 4.1 given in ^4.2 



the triangle inequality gives ||A£||2 < ||D||2+ ||H||2, and since D is diagonal, 

IIDIU = max |2A„(a; + 2A„t + 7(A„))| 

l<n<Nie) 

fl79^ < sup |2A(x + 2At + 7(A))| 

^ ^ -L<X<0 

< sup |2A7(A)| +2L|a;| +4L2|t|, 

-L<A<0 



SO since A7(A) is bounded according to Lemma 3.1, and H is independent of x and t, it is 
sufficient to show that ||H||2 remains bounded as e J, 0. 

To estimate ||H||2, we write H in the following form: H = BTB + E where 

(180) B = diag (^e^-/y-2AiF(Ai), . . . , e'^/'^ -2Xr,i,~^FCXNie))^ , 

and T is the A^(e) x A^(e) Toeplitz matrix with elements Tnm = fn-m, where the sequence 
{/nW is defined by (|T24|). Of course E := H- BTB. Therefore ||H||2 < ||B||2||T||2+ ||E||2. 
Because B is diagonal, 

(181) \\B\\l < max [-2A„F(A„)] < sup [-2AF(A)] 

l<n<Af(e) -L<A<0 



which is finite by Lemma 3.1 The Toeplitz matrix T can be written as T = VTfV, where 
V is the orthogonal projection from £^(Z) onto viewed as a subset of £^(Z) associated 
with components having indices {1, 2, . . . , A^ (e)| C Z, and where 7/ : £^(Z) — t- £^(Z) is the 
Toeplitz operator defined by (123) from ^4.2 The £^(Z) operator norm of V is clearly equal 



to one, and since 

(182) ^/;e^'^ = i(7r-^), < < 27r, 
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the Pythagorean Theorem in L^(0, 27r) gives 

f.27r 



neZ 



27r 



27r 



(183) 



27r 7o 



27r 

1 

2^ 



Cm 6 



27r 



27r 



(tt 



neZ 

EE/" 

r 

E 



de 



^i[n—m]d 



meZ 



neZ 



meZ 



27r 



27r 



E 



Cm C 



AmO 



mGZ 



-^Ei 

mez 



c P 



the £^(Z) operator norm of 7/ is bounded by tt. It follows that ||H||2 < 7r + ||E||2, so it 
suffices to show that ||E||2 remains bounded as e J, 0. 

So far, we have exploited the special structure of the dominant parts of the matrix and 
applied correspondingly specialized norm estimates to these terms. The error term E has 
less structure, but is it smaller; to estimate its norm it will be sufficient to use the rather 
crude inequality ||E||2 < ||E||hs and work with the Hilbert-Schmidt norm 



N{c) N{e) 

(184) llEllls E E 1^' 

n=l m=l 

where the elements of E are explicitly given by 



2 

nm\ ) 



(185) 



r, 



n — m 



, for n 7^ m, and E^n — 0. 



If we introduce continuous variables a {n — \)e and b :— {m — |)e, then it is easy to 
see that the square of the Hilbert-Schmidt norm of E is a Riemann sum approximation of a 
certain double integral: 

(186) lim||E||^s = // eo{a,b)dadb, 

J J[Q,M? 

provided the double integral exists, where 

^m-i(a)m-i(6) ■^m-^{a)F{m-^{a))m-^{b)F{m-^{b)) 



(187) eo(a,6):-4 



m~^{a) — m~^{b) 



a — b 
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and where m 



(1^ 



denotes the inverse function to the monotone function m(-) given by 

r-A 



m(A) 



F(A') d\'. 



-L 



By changing variables to k 



m 



-1^ 



and A 



Co (a, b) da db 



[0,M]2 



-L,0P 



e(K, A) dn dX, 



where 



(190) 



e(K, A) := 4 



^ v//€F(/t)AF(A) 



K — X m^K) — m(A) 



F(k)F(A). 



Note that since -F > by Lemma 3.1, e(/€, A) > for (k, A) G [— L, 0] . To complete the 



proof of the Lemma it is enough to show that the double integral on the right-hand side of 



(189) is finite. 



In order to estimate the double integral, we divide the square [— L,0]^ into polygonal 
regions as follows (see Figure [?]): 

• The square [—L, —L + 5]^ contains those ordered pairs (k, A) for which both k and A 
are near the "soft edge" of the eigenvalue spectrum at — L. We divide this square into 
diagonal and off-diagonal parts according to whether (k + L)/2 < A + L < 2(k + L) 
(the diagonal part, 6*0) or not (the off-diagonal parts, Sod)- 

• The square [—5,0]^ contains those ordered pairs (k. A) for which both k and A are 
near the "hard edge" of the eigenvalue spectrum at 0. We divide this square into 
diagonal and off-diagonal parts according to whether 2k < A < k/2 (the diagonal 
part, Hji) or not (the off-diagonal parts Hqd). 

• The remaining part of [— L,0]^ contains those ordered pairs {k, A) for which at least 
one of the coordinates lies in the "bulk" of the eigenvalue spectrum, bounded away 
from both edges. This is divided into a diagonal part Bd and two off-diagonal parts 
Bod along two straight line segments parallel to the diagonal as indicated in Figure [7| 



Here, the constant 5 > is as specified in Lemma 3.1 As e(/t. A) = e(A, k) it will be enough 
to show integrability of e over the part of [—L, 0]^ with /t < A, an inequality that we will 
assume tacitly below. 

First we consider integrating e(K, A) over the "off-diagonal" shaded regions S'od, -Bod, and 
iJoD shown in Figure [7j An upper bound for e{K, A) useful in these regions is easily obtained 
from the inequality {a — b^ < 2a^ + 26^: 

1 F{k)F{X) 



(191) e{K,X)<8KF{K)XF{X) — + 

[K — Aj^ [■m[K) — m(Ajj^ 

Applying the Mean Value Theorem to this estimate yields 



{k,x) g {-L,oy 



(192) 



e{n, A) < 



IkF{k)XF{X) 



A)^ 



where k < < X. Finally, since F is monotone increasing according to Lemma 3.1 we obtain 



(193) e(K,A)< 



8kF{k)XF{X) 



A)^ 



1 + 



F(A) 



8kF{k)XF{X) 8kXF{X) 



A)^ 



{K-xy 



F{X). 
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(0,0) 





TT jn"^ / 

-nOD / 


^OD / 


Hd / 
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Sob / 
/ Sd 
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/ 'S'OD 





-L,-L) 



Figure 7. The square [—L, 0]^ in the (k, X)-plane is covered by the six regions 
Sd, Sod, Bd, Bqd, Hd, and Hqd- 



Now, for (k, A) G -Bod, we have that k — X is bounded away from zero while by Lemma 3.1 
k,F{k) and AF(A) are bounded (and of course \k\ < L) while F{X) is integrable. Hence we 
easily conclude that e(/t, A) is integrable on -Bqd- 

If (k, A) G Hqd with k < X, then we have the inequality 

,2 



(194) 



{K-xy 



X- 



K 



2 



and also since both —S<k<0 and — 5 < A < we may use the upper bound for F given 
in (82) from Lemma 3.1 to replace ( |193 ) with 



(195) 



e(K, A) < 32C^ 



-A) 



l-2/g 



where Co > and g > 1 are the constants in (|82j). This estimate is easily seen to be integrable 
on the component of Hqd with k < A by direct calculation of the iterated integrals. 
If (k, A) G S'OD with K < A, then we have the inequality 

2 



(196) {k - 

and also since both 



A)^ 





'X + L 




( 


+ 




2 



X + L 



[k + L) 



> 



{X + Lf 



-L < K < —L + 6 and —L < X < —L + 6 we may use the upper bound 



for F given in (80) from Lemma 3.1 along with the inequalities \k\ < L and |A| < L to 
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replace (193) with 

(197) e(/€, A) < ?,2L'C\{^i + Lf'^{\ + L)-""^ + ?,2L^C\{>^ + L)-\ 

This upper bound is obviously integrable on the component of S'qd with k < \. 

Now we consider integrating e(fi;, A) over the "diagonal" unshaded regions S-o, Bd, and 
Hj) shown in Figure [7} By the Mean Value Theorem and the monotonicity of F guaranteed 
by Lemma 3.1, we obtain an upper bound more useful when k ^ X: 

2 



(198) 



e(K,A) < 4:kF{k)XF{\) 



FiK) - F(A) 



m^K) — m(A) 

Again using the Mean Value Theorem and monotonicity of F we may make the upper bound 
larger for k < A: 

AKXF{\)F\if 



(199) 



e(K, A) < 



F(k) 



where /t < ^ < A. 

For (k, A) G -Bd with /t < A, both /t and A are bounded away from the soft and hard edges 
of the eigenvalue spectrum, so Lemma 3.1 guarantees that F and F' are bounded, and F is 
also bounded away from zero by strict monotonicity and the boundary condition F{—L) = 0. 
It follows from (199) that e{K, A) is bounded and hence integrable on Bj). 



If (k. A) G i^D then we may use the estimates (82) and (83) from Lemma 3.1 to replace 
( pjgj ) with 

(200) e(«:. A) < ^(-«:)i+V.(_A)i-V^(-e)-2/^-^ < 

The double integral of this upper bound over the region Hd with k < A is easily computed 
by iterated integration and is clearly finite as a consequence of the fact that g > 1. 

Finally, if (k. A) G with k < A, then we may use the estimates (80) and (81) from 
Lemma 3.1 together with the inequalities < L and |A| < L to replace (199) with 

(201) e{K, A) < 2L^CIl{k + L)-^/\\ + Lf/\^ + L)-^ < 2L^CIl{k + LY'"\\ + LY'\ 

an upper bound that is clearly integrable over the part of with k < A. □ 

Lemma 4.9. The measure fi^ converges in the weak-* sense to /x, uniformly for {x,t) in 
compact sets. That is, for each continuous function / : M — )■ C, 



(202) 



lim 



/(a) dfi^{a) 



with the limit being uniform with respect to (x, t) in compact sets. 



Proof. According to Lemma 4.7, for each polynomial p{a) we have the following limit, uni- 
form for (x, t) in compact sets: 



(203) 



lim / p{a)d^^{a)= / p{a)d^{a). 
'^^ Jr Jr 



But by Lemma 4.8 we can equivalently integrate over the compact interval $7 (independent of 
{x, t) in any given compact set) with the same result. Now by the Weierstrafi Approximation 
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Theorem, given any continuous function / : M — t- C and any p > there is a polynomial 



pl{<y) for which 



(204) 



sup|/(a) -pUa)] < 



P 



so for any measure u of mass M with support in Q (like and /i), 



(205) 



[/(") 



< / \f{(^) -Pi{a)\du{a) < p. 



Let w > be an arbitrarily small positive number. Then if we write 

(206) u[g] := / gia)duia), 

we have 
(207) 

/(a) (i/ie(a) — / f{a)dn{a) 



< 



< 



\Mf]-m 
p^ipL/sI - p[pi/3] 



+ Pe[f 



p[f - pi 



pAf -p! 



/ 



w/3J 



Pe[p! 



w/3J 



fl[pl 



w/3J 



with the last inequality following from (205). But with to > fixed, (203) implies that e > 
may be chosen sufficiently small, independently of {x,t) in any given compact set, that 



(208) 

which implies 
(209) 

thereby completing the proof 



pApL/sI-pIp 



ui/3i 



1 



f{a)dfi^{a)— / f{a)dfi{a 



□ 



Now we are in a position to complete the proof of Proposition 4.2 We begin by writing 
Ue{x,t) as defined by (93) in terms of the normalized (to mass M) counting measure p^: 

(210) U,{x,t)= /"2arctan(e~^a)rf/ie(«). 
Define the continuous functions 

(211) a+(a) := vr + 4if(— a) arctan(a), a_(a) := — a+(— a), a G M, 

where H{-) denotes the Heaviside step function. It is then easy to check (see Figure [s]) that 
for any E > 0, 



(212) 



0<e< E 



a^(E ^a) < 2arctan(e '^a) < a^{E ^a)^ a G 
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-4 -2 2 4 



Figure 8. The graphs of a^{a) < a+{a) (black) and several graphs of 
2 arctan(e^^a;) fore < 1 (gray). 

Therefore, for any E > and all < e < ii^, 

(213) / a^{E^^a) dfi^{a) < / 2 arctan(e~^a;) (i/ie(a) < / a+{E~^a) dfieict). 

Jm. Jr Jr 



Using Lemma |4.9| we may pass to the limit e J, in the lower and upper bounds to obtain 

(214) liminf / 2 arctan(e~"'^a) (i/ie(a) > / a_(E~^a) djj.{a) 

^-1-0 Jr Jr 

and also 
(215) 



limsup / 2arctan(e ^a)d^^{a)< / a^{E ^a)dfi{a). 
eio Jr Jr 



In these statements, > is an arbitrary parameter, and the limits are uniform for {x, t) 
in compact sets. But a±{E^^a) are uniformly bounded functions that both tend pointwise 
for a 7^ to the same limit function 7rsgn(A) as E I 0, while yU is a fixed measure that is 
absolutely continuous with respect to Lebesgue measure on M, so by the Lebesgue Dominated 
Convergence Theorem, 



(216) 



lim / a±{E ^a)dfi{a)= / 7rsgn( 
■^J-O Jr Jr 



a) dfi{a) 



By letting E | 0, it then follows from ( |214| ) and ( pl5| ) that 
(217) 



lim / 2arctan(e ^a)d^^{a)= / n sgn{a) d^{a) 
'^^'^ Jr Jr 



with the limit being uniform for {x,t) in any given compact set. Finally, according to (71) 
we have (independent of x and t) 

(218) lim^ = l, 



proof of Proposition 4.2 



so combining this result with (217) and noting that dfi{a) = G{a;x,t)da completes the 
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4.4. Differentiation of U^. Burgers' equation and weak convergence of u^. Let 

G ^(M) be a test function. Tlien by integration by parts and tlie uniform con verg ence of 
Ue{x,t) to U{x,t) on compact sets in tlie (x,t)-plane guaranteed by Proposition 

■Ue(x, = lim / -—^{x,t)(f){x)dx 

e4,0 



4.2 



lim 

£4-0 



dx 



(219) 



lim 
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U^{x, t)(j)'{x) dx 
U {x, t)(f)'{x) dx. 



Lemma 4.10. The limit function U{x,t) is continuously differentiable with respect to x, and 



if {x,t) is a point for which there are 2P{x,t) + 1 solutions Mq (x, t) < 



< u 



2P{x 



,t)i^,t) of 



the implicit equation (224), 
(220) 



dx 



X, t) 



2P{x,t) 
n=0 



\n„.B, 



n 



X, t), 



and the above formula is extended to nongeneric {x, t) by continuity. 

Proof. Exchanging the order of integration in the double-integral formula for U{x, t) obtained 
by substituting dfj,{a) = G{a; x,t) da with G given by (100) into (99), we obtain 



(221) 
where 
(222) 



U{x,t) 



J(A; X, t) d\, 



p-2X(x+2\t-x^{X)) 

J{X;x,t) := — -pr / sgn{a)da. 

J-2X{x+2Xt-x+{X)) 

Note that for A G [—L, 0] the upper limit of integration is greater than or equal to the lower 
limit. Moreover the integral in J(A; x,t) is easily evaluated; for —L < A < 0, 

{-7rF(A), x + 2At-2;_(A) < 0, 

a; + 2At + 7(A), x + 2\t - x+(A) < < x + 2At - x_(A) 
7rF(A), x + 2At-x+(A) > 0. 

It follows from the relations x-i-(A) = ±7rF(A) — 7(A) that for an admissible initial condition 
Uq, J is a continuous function of x for each fixed t, uniformly with respect to A G [— L,0], 
and hence also from (221) that U{-,t) is continuous on M for each t. To prove that U{-,t) is 
continuously differentiable it will therefore suffice to establish continuous differentiability on 
the complement of a finite set of points and that the resulting piecewise formula for dU /dx 
extends continuously to the whole real line. 



To use the formula (223) in the representation (221) we therefore need to know those 
L, 0) at which one of the two quantities x + 2Xt — x+(A) < x + 2\t — x_(A) 



points A G 

changes sign. Under the variable substitution A 



-u 



the definition of the turning points 



x-i-(A) as branches of the inverse function of Uq implies that the union of solutions of the two 
equations x + 2\t — x-i-(A) = is exactly the totality of solutions of the implicit equation 



(224) 



= uo{x- 2u^t). 
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In other words, the transitional points A for the formula (223) correspond under the sign 
change = — A to the branches of the multivalued solution of Burgers' equation 



(225) 







dt dx 

subject to the admissible initial condition u^{x,0) 

Note that admissibility of Uq implies (see Definition 3.1) that given any t G 
only a finite number of breaking points 



Uo[X) 



there exist 



[Xt:,t^) 



tf) with in the closed interval between and 



t. Indeed, the breaking points correspond to values of ^ G M for which Uq{^) = but 
u'q{^) 7^ 0, and the breaking times are = {—2u'q{^))~^ ; since u'q{^) decays to zero for large 
^, bounded breaking times correspond to bounded ^, and there are only finitely many 
of these by hypothesis. Moreover, each breaking point {x^,t^) generates a new fold in the 
solution surface lying between two caustic curves emerging in the direction of increasing \t\ 
from {x^,t^), and because m'"(0 there are exactly two more sheets of the multivalued 
solution of Burgers' equation born within the fold as a result of a simple pitchfork bifurcation. 
Therefore, the union of caustic curves and breaking points meets any line of constant t in 
the (x, t)-plane in a finite set of points {xj'''*(t)}, and on every connected component of the 
set St := {{x,t)\x G M \ {,T""'*(t)}}, there is a finite, odd, and constant (with respect to 
x) number 2P{x,t) + 1 of roots of the equation (224), and all roots are simple (and hence 



differentiable with respect to x). 

If t > 0, then by admissibility of Uq the quantity &_(A; x, t) := x + 2Xt — x_{X) is strictly 
increasing as a function of A on the interval {—L, 0), and therefore in this interval there can 
exist at most one root of 6- (A; x, t), regardless of the value of x G M. Moreover, 6_(A; x, t) 
+00 as A t 0, so there will be exactly one root in {—L, 0) if b-{—L; x,t) = x — xq — 2Lt < 
and no root in {—L, 0) if x — xq — 2Lt > 0. Since 6+(A; x, t) := x + 2\t — x+(A) < 6-(A; x, t) 
for —L < A < 0, if X — xo — 2Lt < 0, all roots of 6+(A; x, t) in {—L, 0) must lie to the right 
of the root of &_(A; x, t). Thus, for x & St\ {xq + 2Lt}, we either have 



U{x,t) 



(x + 2At + 7(A))rfA 



(226) 



P{x,t) 

p=i 
+ n 



TX 



F{X)dX + 



^2p 



(x + 2At + 7(A))rfA 



F{X) dX, X e St, X > xo + 2Lt, 



in which case Mq (x, t) < ■ ■ ■ < u^pi^^ ()(x, t) are all roots of 6+(— m^; x, t), or 



U{x,t) 



(x + 2At + 7(A))c/A 



(227) 



P{x,t) 

p=i 



7r 



-'2p-2 



F{X)dX + 



^2p-l 



-'2p-l 



^2p 



(x + 2At + 7(A))rfA 



— TT 



2P(x,t) 



F(A) dX, X e St, X < Xo + 2Lt, 
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in which case UQ{x,t) < ■■■ < u^pf^^ ^•^_'^^{x,t) are roots of b+{—u^;x,t) while U2p(^f){x,t) 



with U2P(xt)(^'^) ^ '"2P(a; ^) ^ t). In both cases, the condition 

X & St guarantees that all roots are differentiable with respect to x, so we may calculate 
dU/dx by Leibniz' rule: 



(228) 



or 



dx 



x,t) = h+{-u^p{x,t)]x,t)-^\ 



2P-1 



dul 



X 



t)+Y.^-lTh^{-ul{x,t)-x,t)''-^{x,t) 



n=0 



2P 



+ Y^{-lTul{x,t), xeSt, x>Xo + 2Lt, 



n=0 



(229) 



dU ^ 
dx 



x,t) 



b-{—U2p{x, t);x, t) ^^^ (x, 



2P-1 



dul 



dx 



t) + J2i-'^rb4-u^ix,ty,x,t)^ix,t) 



n=0 



2P 



+ ^(-l)"M|^(x,t), X e St, x<xo + 2Lt, 



n=0 



where in both cases P = P{x, t) is a constant nonnegative integer on each connected compo- 
nent of St- The terms on the first line in each of these formulae arise from differentiating the 
limits of integration and using x±{X) = ±ttF{\) — 7(A), while the terms on the second line 
arise from the explicit partial differentiation of the integrand x + 2Xt + 7(A) with respect to 
X. It follows from our division of the solutions of ( 224[ ) among the roots of 6+ and 6_ that 
in both cases the terms on the first line vanish identically, with the result that 



(230) 



dU 
dx 



x,t) 



2P(x,t) 
n=0 



l)"<(a;,t), X e St\{xo + 2Lt}. 



This expression is clearly continuous in x on each connected component of St \ {xq + 2Lt}. 
Moreover, it extends continuously to the finite complement in M^. (at fixed t > 0) because at 
caustics pairs of solution branches entering into (230) with opposite signs simply coalesce. 



Therefore U{-,t) is indeed continuously differentiable for t > and its derivative is given by 
the desired simple formula (220). Virtually the same argument applies to t < with the 
roles of &±(A; x, t) reversed, and the resulting formula for dU /dx is the same. □ 



It follows from this result that we may integrate by parts in (219) and obtain 

dU 



(231) 



lim 

£4-0 



Ue(x, t)(j){x) dx 



dx 



(x, t)(j){x) dx 



for every test function e ^(M). Now let v e L^(]R). Since ^(R) is dense in 
each a > there exists a test function 0o- G ^(M) such that 



for 



(232) 



|2 . 
I2 ■" 



fx) — v(x)\'^ dx < 
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Then, 



dU ^ 
dx 



X, t) 



v{x) dx 



ix) dx 



(233) 



+ 



^1 

dx 



X,t) — v{x)] dx 



Ue{x,t) [0(t(x) — v{x)] dx. 



Observe that, according to the definition (see Definition 3.2) of u^{x,t) in terms of the 
modified scattering data, it follows from (47) and (46) that 



(234) 



/ Uejx, tY dx = —Ane y ^ A 

n=l 



This Riemann sum converges as e J, 0: 



(235) 



lim 

£4-0 



Ue{x, ty dx 



-An 



\F{\)d\ = / uoixfdx 



where the second equality follows from the identities (57), which essentially define F{X) in 
terms of the admissible initial condition uq. Therefore, ||'Ue(-,t)||2 is bounded for sufficiently 
small e, independently of t. 

Also, dU/dx is independent of e and from the formula (220) it is easy to check that it is 
positive and bounded above by the constant L for all {x,t). Therefore 

2 



(236) 



dx 



< L 



dU ^ 
dx 



X, t) dx. 



By the formula (220), the latter integral is equal to the area between the graph of the 
multivalued solution curve for Burgers' equation and the x-axis. Since points on the graph 
at the same height move with the same speed, this area is independent of time t, and hence 
we have 

2 



(237) 



dx 



< 2nLM, 



where the mass M is defined in terms of the initial condition by ( 52 ) . In fact, for < t < T, 
where T is the breaking time, it follows from the fact that dU/dx as given by (220) reduces 
to the classical solution u^{x,t) of Burgers' equation with initial data uq, which conserves 
exactly the L'^{M.x) norm, that 

2 



(238) 



dU 
dx 



\u 



uo{xY dx, < t < T. 



We will use this fact below in ^when we prove Corollary lA In any case, these considera- 
tions show that for all e > sufficiently small there exists a constant K > independent of 
t such that 

dU , , ,,~ / M, 

.t) +\\u,{-,t)\\2<K 

2 



(239) 
holds for alH > 0. 



dx 
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Now, by Cauchy-Schwarz it follows that 
dU 



(240) 



dx 



X, t) [(pa^x) — v{x)] dx — I t) [4>cf{,x) — v{x)] dx 



<KU,~v\ 



Given w > arbitrarily small, we then choose cr = uj/{2M) and then (233) implies that 



(241) 



Ue{x, t) 



dU ^ 
dx 



X, t) 



v{x) dx 



< 



Ue{x, t) 



dU ^ 
dx 



X, t) 



4>u/{2M){x) dx 



UJ 

2' 



Finally, since (f)ui/{2M) is a test function independent of e, we may use (231) to choose e > 
so small that the first term on the right-hand side is less than u/2. 
This proves that 



(242) 



Wx-nmue[x,t) = —{x,t) 

eiO ox 



(weak L convergence) uniformly for t in bounded intervals. Combining (220) with (242) 
completes the proof of Theorem 1.1[ 

5. Strong Convergence Before Breaking 



In this brief section we give a proof of Corollary 1.1, following closely Lax and Levermore 
(see Theorem 4.5 in part II of pO]). Starting from the identity 



(243) \\u,i;t)-U^i;t) 



Ue{x,t)^dx+ / (x, t)^ (ix — 2 / u^{x,t)v^{x,t) dx 



we note that for < t < T, where T is the breaking time, (235) and ( 238[ ) imply that 
(244) lim||M,(-,t) -Mo(-,t)||2 = 2 / u^^xf dx - 2\im I u,{x,t)u^{x,t) dx. 



But UQ(-,t) G L^(]R) is independent of e, so by Theorem 



1.1 



(245) 



lim 

£4-0 



(x, t)^ c/x = / U{){xY dx 



Ue{x, t)Mo (x, t) dx = 

with the second equality following from ( 238[ ) for < t < T. Therefore 
(246) lim||«,(-,t)-M^(-,t)||2 = 

as desired, and the proof is complete. 

6. Numerical Verification 

To illustrate the weak convergence of Ue{x^t) as guaranteed by Theorem |1.1[ and to at- 
tempt to empirically quantify the rate of convergence, we have directly used the exact formula 
(93) for Ue{x,t) having first chosen the modified scattering data corresponding to the admis- 
sible initial condition uo{x) = 2(1 + x^)"^ as specified in Definition 3.2, and compared the 
result for several different values of e with the limiting formula (99) for U{x,t). Our results 
are shown in Figure |9] These plots clearly display the locally uniform convergence specified 
in Proposition |4.2[ An interesting feature is the apparent regular "staircase" form of the 
graph of Ue{x, t) as a function of x; that the steps have nearly equal height is a consequence 
of the fact that near the leading edge of the oscillation zone for (which lies approximately 
in the range 4 < x < 16 in these plots) the undular bore wavetrain that is generated from 
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Figure 9. Left: plots ofU^{x,t) (black) and its locally uniform limit U{x,t) 
(red) at t = 4 for various values of e. For these plots, uo{x) := 2(1 + x"^)'^. 
Right: corresponding plots of the error U{x,t) — Ue{x,t). 
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the smooth initial data resolves into a train of solitons of the BO equation, each of which 
has a fixed mass proportional to e (independent of amplitude and velocity). 

To the eye, the size of the error between lf^{x,t) and U{x,t) appears to scale with e. To 
confirm this more quantitatively, we collected numerical data from several experiments, each 
performed with a different value of e at the fixed time t = 4. The supremum norm, calculated 
over the interval — 10 < x < 20, of the error resulting from each of these experiments is 
plotted in Figure ITO] On this plot with logarithmic axes, the data points appear to lie along 



o 

!-l 



bjO 
Q 




-2.5 



-1.5 

logio(e) 



-0.5 



Figure 10. Circles: logio(||f/.(-, 4) - f/(-, 4)|U) /or e 
1/40, 1/45, 1/50, and 1/100, as a function of log]^Q(e) 
squares linear fit. 



1/25, 1/30, 1/35, 
In red: The least- 



a straight line, and we calculated the least squares linear fit to the data to be given by 



(247) 



logio(||f/e(-,4) - U{-A)\\oo) = 0.988 logio(e) + 0.523 



where the slope and intercept are given to three significant digits. This strongly suggests a 
linear rate of convergence, in which the error is asymptotically proportional to e as e 4- 0. 

The initial data Uq{x) = 2(l + a;^)~^ was chosen for these experiments because it is the only 
initial condition (up to a constant multiple) for which the exact scattering data is known 
for a sequence of values of e tending to zero. This is the result of a calculation of Kodama, 
Ablowitz, and Satsuma [IH], who showed that if Uo{x) = 2(1 + x^)~^, then the reflection 
coefficient /3(A) vanishes identically if e = 1/A^ for any positive integer A^. Moreover, there 
are in this case exactly N eigenvalues Ai < A2 < ■ ■ ■ < Aat of the operator C defined by ([6]), 
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and they are given implicitly by the equation 

(248) (^'^] = Ln (-2iVA„) = 0, 



where Lat is the Laguerre polynomial^ of degree N. The corresponding phase constants 7„ 
all vanish exactly. The approximate eigenvalues determined from the initial condition mq 



via the formula ( 72 ) do not agree exactly with the scaled roots of the Laguerre polynomial 
of degree N (although the approximate phase constants agree exactly with the true phase 
constants), so it is a worthwhile exercise to compare the function u^{x,t) as specified by 



Definition |3. 2 with the true solution u^{x^ t) of the Cauchy problem for the BO equation with 
initial data Uq{x) = 2(l + x^)~^. Of course Corollary 1.1 guarantees strong convergence in 



at t = (that is, Me(-,0) is L -close to uo{-)) but this alone does not guarantee that Ue{x,t) 
approximates Ue{x,t) in any sense for t > 0. We made the comparison for several values of 
e > corresponding to a refiectionless exact solution of the Cauchy problem constructec^^ 



from the determinantal formula (39) at the time t = 4, which is well beyond the breaking 
time. Our results are shown in Figure [11} These plots show that the modification of the 
scattering data used to construct u^{x,t) results in a phase shift relative to u^{x,t) that 
is proportional to e, the approximate wavelength of the oscillations in the undular bore 
structure. In particular, u^{x,t) does not remain close to u^{x,t) after the breaking time in 
any strong sense, although is appears highly likely that convergence is restored in the weak 
topology. 

7. Conclusion 

In this paper, we have obtained the first rigorous results regarding the zero-dispersion limit 
of the Cauchy problem for the BO equation. As suggested by the formal mulitphase aver- 
aging of modulated A^-phase wavetrain solutions carried out by Dobrokhotov and Krichever 
[9], the scalar inviscid Burgers' equation and its multivalued solution after wave breaking 
characterizes the limit. 

To analyze the BO Cauchy problem, we used a remarkable formula of Matsuno [22l 123] 
for the density of eigenvalues of the nonlocal operator C appearing in the scattering theory. 



and we have proposed a new asymptotic formula (68) for the corresponding phase constants 
necessary to set up the inverse-scattering problem. We then developed an analogue of the 
Lax-Levermore method [2^ to study the inverse-scattering problem, and we obtained an 



^The asymptotic density of zeros (here scaled by the factor —2N) of the Laguerre polynomials is well- 
known: 

F{X) = -\I^^, -2<A<0, 

a distribution also known in random matrix theory as the Marchcnko-Pastur law. This asymptotic formula 
agrees exactly with Matsuno's formula for F{X) in the case when uo{x) — 2(1 + which gives some 

independent justification for its validity. 

"'in fact this is the numerical method we used to create the plots in Figure [l] This has a tremendous 
advantage over taking a more traditional numerical approach to the Cauchy problem for the BO equation 
(that is, one involving time stepping) since the calculations necessary to find the solution for any two given 
values of t are completely independent, so errors do not propagate (and to find the solution for any given 
time t it is not necessary to perform any calculations at all for intervening times from the initial instant). 



The only source of error in the use of the determinantal formula ( 39 ) , at least if the differentiation is carried 
out explicitly resulting in a sum of iV determinants, is due to round-off. 
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Figure 11. Left: plots of u^{x,t) (black) shown together with Ue{x,t) (red) 
for the initial data uo{x) = 2(1 + x^)^^ shown for several values of e at t = 4. 
Right: The error Ue{x, t) — Ue{x, t). 



explicit formula (100) for a measure fi with density G{a;x,t) that is the BO equivalent of 
the extremal measure in Lax-Levermore theory. By contrast with the KdV case, the formula 
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we obtain for the weak limit from this measure is remarkably simple and explicit, writing 
the weak limit as a signed sum of branches of the multivalued solution of Burgers' equation. 

A useful generalization of the weak limit given in this paper is to consider the weak limits 
of all of the variational derivatives 

(249) ^-=T^' n = 3,4,5,..., 



where the is the n^^ conserved quantity given by (45). Each of the quantities Kn can be 
written in the form 

d ~ 

(250) Kn+2 = ^U,{x,ti,t2,...,tn) 

where after the differentiation ti is set equal to t and all tk for k > 1 are set to zero. Here 



Ue{x, ti, ^2, ■ ■ ■ ,tn) is givcu by (93) with each occurrence of 2t in the expression — 2Afc(x+2Afct) 
occurring in the matrix elements of replaced by 2ti — 3Xkt2 + 4Ap3 — 5Ap4 + ■ ■ ■ + 
(— l)""'"^(n + l)A^~^t„. Formulas for these higher weak limits can also be obtained within the 
framework of our method and will be published in a subsequent paper. 

We are also currently investigating prospects for strengthening the limit after wave break- 
ing occurs. The goal here is to rigorously establish an asymptotic formula for u^{x,t) that 
is valid pointwise for {x, t) in the oscillation zone. Such a formula should accurately resolve 
the microscopic (wavelength proportional to e) oscillations, including finding the phase up 
to error terms that are bounded by a vanishingly small fraction of the wavelength. One ex- 
pects the asymptotic form of the wavetrain to be given by the rational-exponential formulae 
found by Dobrokhotov and Krichever [9] . For the KdV equation such pointwise asymptotics 
have been obtained [8] using the Deift-Zhou steepest descent technique for matrix-valued 
Riemann-Hilbert problems. We are working to extend this kind of methodology to the con- 
text of scalar Riemann-Hilbert problems with nonlocal jump conditions, as occurs in the 
inverse-scattering transform (generally with nonvanishing reflection coefficient) for the BO 
equation. 

It has been recently conjectured by Dubrovin [IQl that near the earliest breaking point 
{x^,t^) the solution of the Cauchy problem for quite general weakly dispersive Hamiltonian 
perturbations of Burgers' equation should exhibit a universal form expressed in terms of 
Painleve transcendents. This conjecture has been confirmed for general initial data for the 
KdV equation (as a particular case of a perturbation considered by Dubrovin) by Claeys 
and Grava [5J. It would be interesting to determine by direct calculation of the solution near 
the breaking point {x^,t^) whether the BO equation should be considered to fall within the 
universality class of equations conjectured by Dubrovin. 
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